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A New Storage Element Suitable for 


Large-Sized Memory Arrays— 
The ‘T'wistor 


By ANDREW H. BOBECK 


Three methods have been developed for storing information in a coinci- 
dent-current manner on magnetic wire. The resulting memory cells have’ 
been collectively named the ‘‘twistor”’. Two of these methods utilize the strain 
sensitivity of magnetic materials and are related to the century old Wertheim 
or Wiedemann effects; the third utilizes the favorable geometry of a wire. 

The effect of an applied torsion on a magnetic wire is to shift the preferred 
direction of magnetization into a helical path inclined at an angle of 45° 
with respect to the axis. The coincidence of a circular and a longitudinal 
magnetic field inserts information into this wire in the form of a polarized 
helical magnetization. In addition, the magnetic wire itself may be used as 
a sensing means with a resultant favorable increase in available signal since 
the lines of flux wrap the magnetic wire many times. Hquations concerning 
the switching performance of a twistor are derived. 

An experimental transistor-driven, 320-bit twistor array has been built. 
The possibility of applying weaving techniques to future arrays makes the 
twistor approach appear economically attractive. 


I. INTRODUCTION 


A century ago Wiedemann! observed that if a suitable magnetic rod 
which carries a current is magnetized by an external axial field, a twist 
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of the rod will result. The effect 1s a consequence of the resultant helical 
flux field causing a change in length of the rod in a helical sense. Con- 
versely, it was also observed that a rod under torsion will produce a 
voltage between its ends when the rod is magnetized (see Fig. 1). 

Recently, during an investigation of the magnetic properties of nickel 
wire, it was observed that a voltage was developed across the ends of a 
nickel wire as its magnetization state was changed. Both the amplitude 
and the polarity of the observed signal could be varied by movements 
of the nickel wire. Most surprising, the amplitude of the observed voltage 
v2 of Fig. 2, was many times that which would be expected if a con- 
ventional pickup loop were used. 

After determining experimentally that the observed voltage was 
generated solely in the nickel wire and was not a result of air flux coupling 
the sensing loop (nickel wire plus unavoidable copper return wire), it | 
was concluded that the flux in the nickel wire must follow a helical path. 
This suggested that torsion was the cause of the observed effect, a con- 
clusion verified experimentally. The direction of the applied twist de- 
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Fig. 1 — Observation of an internally induced voltage vz generated by a mag- 
netic wire under torsion. 
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_ Fig. 2— Comparison of the internally induced voltage v2 to the voltage 1 
induced in the pickup loop. 
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termined the polarity and the amount of the twist determined the mag- 
nitude of the observed voltage. 

As a consequence of these results, it is possible to build mechanical- 
to-electrical transducers,” transformers with unity turns ratio but possess- 
ing a substantial transforming action, and a variety of basic memory 
cells. 

This paper will be concerned with a discussion of the memory cells 
from both a practical and theoretical viewpoint. It will be shown how 
these cells can be fabricated into memory arrays. One such configuration 
consists solely of vertical copper wires and horizontal magnetic wires. 
Experimental results of the switching behavior of many magnetic ma- 
terials when operated in the ‘‘twisted” manner will be given. 


II. A COINCIDENT-CURRENT MEMORY CELL — THE TWISTOR 


Consider a wire rigidly held at the far end and subjected to a clockwise 
torsion applied to the near end. This will result in a stress component of 
maximum compression’ at an angle of 45° with respect to the axis of the 
wire in the right-hand screw sense, and a component of maximum tension 
following a left-hand screw sense. All magnetic materials are strain 
sensitive to some degree. This will depend upon both the chemical com- 
position and the mechanical working of the material. For example, if 
unannealed nickel wire is subjected to a torsion, the preferred direction 
of magnetization will follow the direction of greatest compression, as 
would be predicted from the negative magnetostrictive coefficient of 
nickel. Unannealed nickel wire, then, will have a preferred remanent flux 
path as shown in Fig. 3. 

If the ease of magnetization as measured along the helix is sufficiently 
lower than that along the axis or circumference, it is possible to insert 
information into the wire in a manner somewhat analogous to the usual 
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Fig. 3 — Relationship of the mechanical stresses resulting from applied torsion 
to the preferred magnetic flux path in nickel. 
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coincident-current method. Consider a current pulse 7; applied through 
the nickel wire in such a direction as to enhance the spiraling flux, and 
a second current pulse J: applied by means of an external solenoid (see 
Fig. 4). Coincidentally, the proper amplitude current pulses will switch 
the flux state of the wire; either alone will not be sufficient. To sense the 
state of the stored information it is necessary either to reverse both cur- 
rents, or to overdrive J» in the reverse direction. In an array, the output, 
in the form of a voltage pulse, would be sensed across the ends of the 
nickel wire. The solenoid may be replaced by a single copper conductor 
passing at right angles to the nickel wire. For obvious reasons the mem- 
ory cell has been named the “twistor’’. The above method of operation 
will be referred to as mode A. 

Mode B is the use of the magnetic wire as a direct replacement for the 
conventional coincident-current toroid. Its use here differs only in that 
the wire itself is used as a sensing winding (refer to Fig. 5). The pulses 
I, and J, are equal in value and each alone is chosen to be insufficient to 
switch the magnetization state of the wire. The coincidence of J; and I; 
will, however, result in the writing of a bit of information into the wire. 
To read, I, and J, are reversed in polarity and applied coincidently. The 
output appears as a voltage pulse across the ends of the nickel wire. 





L7F LUX PATH 


I, 


Fig. 4 — Coincident currents for the ‘‘write’’ operation in a twistor operated 
mode A. Wire under torsion. 





Fig. 5 — For mode B the coincidence of J; and J. is required to exceed the knee 
of the g-NJ characteristic. Wire under torsion. 
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The third method of operating the twistor, mode C, is similar in 
nature to a method proposed by J. A. Baldwin.’ In this scheme the wire is 
not twisted, so that neither screw sense is favorable. By the proper ap- 
plication of external current pulses, information will be stored in the 
wire in the form of a flux path of a right-hand screw sense for a ‘‘1”’, 
and a left-hand screw sense for a “‘0’’. The operation of the cell is indi- 
cated in Figure 6. Note that the writing procedure requires a coinci- 
dence of currents; the reading procedure does not. The sign of the 
output voltage indicates the stored information. 

Modes A and C are best suited for moderate sized memory arrays 
since the reading procedure is not a coincident type selection. Thus to 
gain access to n° storage points, an access switch capable of selecting 
one of n* points is required. For large arrays the use of mode B is indi- 
cated. It then becomes possible to select one of n* points with a 2n posi- 
tion access switch. The crossover point (about 10” bits) is determined by 
access circuitry considerations. 





SIGNAL OUT i 2 ae OR 


Fig. 6 — Read-write cycle for a twistor operated mode C. The wire is not 
under torsion. 
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IiI. ANALYSIS OF THE SWITCHING PROPERTIES OF THE TWISTOR 


Section 3.1 will deal with the basic properties of magnetic wire as they 
pertain to the twistor memory cell. Section 3.2 will be concerned with a 
composite magnetic wire. The theoretical conclusions will be supported 
by experimental results wherever possible. 


3.1 Solid Magnetic Wire 


It has been stated above that there is a voltage gain inherent in the 
operation of the twistor. This voltage gain makes it possible to obtain 
millivolt signals from wires several mils in diameter. An expression will 
now be derived relating the axial flux of an wntwisted wire to the circular 
flux component of that wire when twisted. Assume that the magnetic 
wire has been twisted so that the flux spirals at an angle 6 (normally 
@ = 45°) with respect to the axis of the wire. If d and / are the diameter 
and length of the magnetized region respectively, then, for a com- 
plete flux reversal the change in the circular flux component is ¢eire = 
l(d/2)(2B, sin 6). 

Here, ¢cire 18 the flux change that would be observed on a hypothetical 
pickup wire which passed down the axis of the magnetic wire. The flux 
change which would be observed by a single pickup loop around the wire, 
if the magnetic wire were not twisted, is Yiongititudinal = 7d B,/2. 
Therefore, Geirc/Yiong = 2 1 sin 0/ad, and for @ = 45°, this expression 
reduces to 


Yeire l/d 
OE see it ee 1 
long? 2.22 (1) 


Thus, for example, if the storage length on a 3-mil wire is 100 mils, 
then a 15:1 gain in flux change (or voltage) is obtained. 


Voss 





V(O) 


(a) (b) 


Fig. 7 — (a) Calculation of the observable voltage Vo»; for a solid magnetic 
wire. (b) Diagram of induced voltage V(r) and resistance R(r). 
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The above derivation assumed that the entire circular flux change 
could be observed externally. Since the magnetic wire must, of necessity, 
serve as the source of the generated voltage the resultant eddy current 
flow reduces the observable flux change by a factor of three. Consider Fig. 
7; assume that the flux reversal takes place in the classical manner and 
consider the circular component of this flux since it alone contributes 
to the observable signal. The induced voltage V(r) at any point r is 
Vir) = V(O)/(o — r/ro), where ro is the radius of the wire and V(0) 
the voltage at r = 0. But V(r), the induced or open-circuit voltage per 
length of wire I, could only exist if the wire were composed of many con- 
centric tubes of wall thickness dr, each insulated from one another. In a 
long wire no radial eddy currents can exist. Therefore the wire of length 1 
can be assumed to be faced by a perfect conductor at both of its ends. 
It remains to calculate the potential between these ends. The resistance 
of the tube is R(r) = pl/2ar dr, where p is the resistivity in ohm-cm. 
The resistance of the wire is given by Ry = pl/mr. These resistances 
form a voltage divider on the znduced voltage in the tube and the total 
contribution of all tubes is obtained by integration; 


a, se pl/ aro” } 
Vonserved = i V(r) ess 


poe @ 


_ V) 
ee oe 





Thus, (1) must be modified ey (2) with the voltage step-up per memory 
cell becoming, 


Vis = Ud 


View 6. 6.66" (3) 





3.11 Bulk Flux Reversal — Classical Case 


The switching performance of a magnetic wire under transient condi- 
tions will now be considered. The speed of magnetization reversal of 
magnetic materials under pulse conditions is best characterized by s, , 
the switching coefficient, usually expressed in oersted-microseconds. It 
is defined as the reciprocal of the slope of the 1/7’, versus H curve where 
7’, is the time required to reverse the magnetization state and H is the 
applied magnetic field intensity. Only eddy current losses will be con- 
sidered. 
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Iirst, consider the case in which the magnetization is entirely circular. 
Reversal from one flux remanent state to the other is assumed to occur 
uniformly in time 7', . Axial eddy currents will flow down the center of 
the wire and return along the surface. The switching coefficient, s, , 
will be obtained by equating the input energy to the dissipated energy. 
The total energy dissipated per unit length is, 


E=7, | " 2nrP(r) dr, | (4) 
and | 
P(r) = a (5) 


where P(r) the power density is given by E(r) the voltage gradient 
squared divided by the resistivity. The average energy per unit volume 
is therefore 


2 
‘ (vo) (* — 4 - el 
Ts To 3 9 
So ie aN ee Oe a 
Tro" Jo pl? (6) 
Le 42) 
18 \ 1 /- 
Now V(0) = [(dB/dt)rol]10~°, so V(0)/l = (2Byr/T')10 *. Putting this 
expression into (6) yields 





Eav/em3 _ 





2.-°2 
Saver/em? = SR 107°. (7) 





The input energy per unit volume is 


—4an-7 
ae Snir (8) 


since AB-H = 2B,H cos 6, where 6, is the angle between the applied 
field H and the switching flux. The factor 10 ‘/47 is a constant relating 
the energy in joules to the BH product in gauss-oersteds. By equating 
(7) and (8), and replacing H by (H — #H)), the desired s,, expression is 
obtained; 


‘(4nBare2) 107 


fe = TE ea Ty) Qo cos 6 


(oe-usec). (9) 


The substitution of H — HAH) for H requires some explanation. The 
switching curve of 1/7, versus H is not a straight line as would be pre- 
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dicted from (9) but generally possesses considerable curvature at low 
drives. Equation (9) satisfactorily predicts the slope of the switching 
curve in the high drive region, but Hy» must be determined experimen- 
tally. In Section 3.12, flux reversal by wall motion is treated as it is a 
possible switching mechanism at low drives. 

The switching coefficient s, for the case in which the magnetization is 
purely axial will now be treated. As above, the flux density will change 
from —B, to +B, uniformly in time 7',. The eddy currents, which are 
circular, result from an induced voltage V(r) where V(r) = [V (ro) (1/ro)'], 
and V(r) is given by V(ro) = [(2B./T;)a70 10 . Thus, E(r) = V(r)/2zrr, 
and H(r) = (B.r/T 310°. Following the procedure used above, the in- 
ternally dissipated energy es is 


(Ba lO (2) 





Sav/em3 — e Qr r dr, 
us p 
re (10) 
_ Bere 16 
Equating this expression to (8) yields 
2 —3 
(f=) t= ia (11) 
p COS 8 


where 6: is defined as the angle between the applied field and the switch- 
ing flux now assumed axial. 

The helical flux vector in a twistor can be resolved into a circular and 
an axial component. Fortunately, since the dissipated energy is pro- 
portional to the eddy current density squared, and the axial and cir- 
cular current density vectors are perpendicular to each other, it is 
possible to write 


Gav/ems(helical) = 6.y/em3(axial) + Say/em3(cireular). (12) 
It follows, for a 45 degree pitch angle, that 


S»(axial) -+ s,,(circular) 


Sw(helical) = 5 


(13) 
where the factor “2” is a consequence of the flux density components 
being smaller by 1/+/2 than their resultant. Substitution of (9) and (11) 
into (18) gives the desired switching coefficient 


134 B.ro 10” 


- (14 
18 pcos é “ 


Sy (helical) = (H — Hy)T; = 
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The term cos @ requires further explanation. The magnetization vector 
is constrained by energy considerations to align with the easy direction 
of magnetization. The angle between the applied field and the easy 
direction of magnetization is called 6. Equation (14) is valid for any 
direction of applied field. The angles 6; and 42 used in deriving (9) and 
(11) respectively are each 45 degrees for the helical pitch angle assumed 
above. 

Equation (14) indicates that for maximum switching speed a material 
with low saturation flux density and high resistivity is required. The 
lower limit on s, will be determined by internal loss mechanisms not 
treated here. Experimentally, this lower bound is found to be approxi- 
mately 0.2 oe-psec. 


3.12 Reversal by Single Wall 


The switching time of a twistor when operating in a memory array 
under coincident current conditions will depend upon the low-drive 
switching coefficient. Experimentally, it is observed that the low drive 
Sy 1s several times the high-drive value. In this section, following the 
method of Williams, Shockley, and Kittel,’ flux reversal by the move- 
ment of a single wall will be treated. Only the circular flux case will be 
considered. | 

The technique used to obtain s, is identical to that used in Section 
3.1.1 except it is postulated that a single wall concentric to the wire 
moves either from the wire surface inward, or from the wire axis outward. 
The result is independent of the direction in which the wall moves. As- 
sume the wall moves from r = 0 tor = ™, as indicated in Fig. 8. In- 


ar V(0) 


6 vir) 





Fig. 8 — Flux reversal by expanding wall instantaneously located at radial 
position aro moving with velocity v. 
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stantaneously, the wall is located at aro and is traveling with a velocity v. 
The induced voltage V..(r) 1s, 3 
Voelr) = (2B,lv)10 ° 0 a gan arg , 
(15) 
= 0 an<r<n, 


and the observable voltage for a wire of length J is given by 


Vic = aes cy) Pilate 
© 0 ©. pl/2Qerrdr 





= i Voe(r) = dr 
0 To 


a Voc(r). 


It is clear in the above integration that V,.(7r) must be treated as a con- 
stant. Using expressions (4) and (5) 


SBavent = [Fe a) Dera 


ot TT" Pp 
8 3) (0 — a2)2ardr, (16) 
Sep arg 


O8av/em3 = (7) (1 — 4 Va" 
ot l p 


The rate of applying energy is 








ds OB (H — H,) cos @\ ,.-7 
a (5 en ae os 6 ) 1 | (17) 
Once again hysteresis losses are not included. Since dB/dt = 2Bunm, 


and V,. is given by (15), the equation of (16) and (17) yields, 


o(H — HF.) cos 6 


a ee 
a) a0 Sr Bary 


Since v = (dr/dt) = 7% (da/dt), 


ae _ AH — FH.) cde ALS 
i i a —@) ag Sr Bro" a; 


3 5 
a a _ p(H — H,) cos 8 
3 5 (8rB,re)10~° a 
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When a equals 0, ¢ equals 0, so the constant of integration is zero. When 
a equals 1, ¢ equals 7’, , so 


16m Bar 
15 pcos @ 





s, = T,(H1 — HH, = ( ) 10° °(oe-usec) (18) 
Comparison to the corresponding bulk flux reversal case indicates that 
the wall motion mechanism is more lossy by a factor of 2.4. 


3.2 The Composite Magnetic Wire 


It is apparent from the switching data of Fig. 9 that for reasonably 
sized solid wires (7) > 1 mil) the switching coefficient s., is unreasonably 
high. The typical ferrite memory toroid, for cxample, when used as a 
memory element has an s, of 0.6 oersted-microseconds. The only possi- 
bility for high speed coincident-current operation for solid magnetic wires 
is that the material have a high coercive force H, , a conclusion not con- 
sistent with the trend toward transistor driven memory systems. 

By the use of a composite wire it should be possible to reduce the eddy 
current losses and still preserve a reasonable wire diameter. A composite 
wire, by definition, will consist of a non-magnetic inner wire clad with a 
magnetic skin. It may be fabricated by a plating or an extruding process. 

The solid wire analysis of Section 3.1 is a special case of composite 


6 


4-79 PERMALLOY;IMIL UNANNEALED 
83 Ni,17Fe, 0.5MNn;2MIL UNANNEALED 
NICKEL; 3MIL ANNEALED 
5 ; NICKEL ON NICHROME;3 
; 3MIL UNANNEALED,@ = 0.70 
NICKEL ON COPPER; 
3MIL ANNEALED, = 0.75 


1/Ts IN {£SECH! 
G) 





0 : 10 15 20 25 30 35 40 
HaxiaL IN OERSTEDS 


Fig. 9 — Reciprocal of flux reversal time 7's as a function of applied axial drive, 
H, for solid and composite magnetic wires. Sufficient torsion applied to reach 
saturation. 
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wire analysis which is given in Appendix I. Only the results of the com- 
posite wire case will be given here. As indicated in Fig. 10, pi and pe 





(a) (b) 


Fig. 10 — (a) Composite wire is composed of non-magnetic core covered by a 
magnetic skin. (b) A voltage V(r) is induced in the wire during flux reversal. 


are the resistivities of the inner (non-magnetic) and outer (magnetic) 
materials. The inner material is contained within a radius 7; . The over- 
all wire radius is re. Defining a = 1,/r2, if Vong is the voltage observed 
across the ends of the composite wire twistor memory cell, and V(0) 
is the induced voltage at r = 0 for a solid magnetic wire of radius 12 , 


Vobs = bV (0), and 
Bibs + ca) +a -_ a 
h = 2 3 3 


(i = a?) +. a? 
p2 


(19) 


The parameter “b” reduces to 3 for a = 0 in agreement with (2) which 
was derived for the solid wire case. Table I gives ‘‘b” for various material 
and geometry combinations. 


TABLE | — THE PARAMETER ‘‘D” 


a 


pi/p2 
0.0 
0.9 0.8 0.7 (Solid Wire) 
0 0.100 0.200 0.300 1.000 
vy 0.0988 0.194 0.285 333 
3 0.0963 0.184 0.259 333 
1 0.0903 0.163 0.219 333 
3 0.0790 0.135 0.180 333 
~ 10 0.0643 0.112 0.155 333 
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TaBLE IJ — THe PARAMETER “‘Ceirc’’ 


a 


pi/p2 aa 
0.9 0.8 0.7 (Solid Wire) 
0 0.0858 0.3538 0.820 1.396 
a 0.0847 0.339 0.763 1.396 
3 0.0842 0.311 0.657 1.396 
1 0.0737 0.256 0.498 1.396 
3 0.0595 0.159 0.341 1.396 
10 0.0286 0.124 0.2438 1.396 
0 0.0209 0.0833 0.186 1.396 


The switching coefficient, s, , for circular flux reversal, is derived as 


(8rB,r2)10~° 


_ BBO" fl yg — 3) — 4 — 8) 
ye cag COS f I a — b) FS (1 — 0) me 
ob 6| aye 40 — b) , 1 
a Sg + (1 — b) ae aoe a5 At 
or | 
2 —3 
Sp Cees ae (oe-usec). (21) 
p2 COS @ 
Table II gives Cire as a function of a and p;/pe. 
The switching coefficient s,, for axial flux reversal is derived as 
24n—3 2 4 
oie (238 10 \C — 4a -- a (3 4 In ?) (22) 
pz COS 8 1 — @ 
2 —3 
= Cesta ee aa) (23) 
pe COs 6 / 


Table III gives Caxiai as a function of ‘a’. 
Since, as explained for the solid wire case, the eddy current density 
vectors for circular and axial flux reversal are in quadrature, 


S»(axial) ++ s,,(eircular) 


S»(helical) = 5 


(24) 


Substitution of (21) and (23) into (24) gives the required expression; 


Ceire + ee) (Bers’) 107° 


Sw(helical) = ( 9 p2 COS 0 


(25) 


A number of composite wire samples have been prepared and evalu- 
ated. These include nickel on nichrome and nickel on copper. The switch- 
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TaBLE ITI — THe PARAMETER “Coaxial” 


0.0 
(Solid Wire) 


nT | | 


ing curves for these samples as well as for a number of solid magnetic 
wires are shown in Fig. 9. The agreement between the measured values 
of s, and the calculated values [(14) and (25)] is quite good. Improved 
composite wire samples are under development. 


IV. EXPERIMENTAL MEMORY CELLS AND ARRAYS 


The initial experiments were performed using commercially available 
nickel wire of 3-mil diameter. The g-NI characteristic of this wire in the 
helical direction is extremely square. This is a feature of all the magnetic 
materials tested whether annealed or unannealed. As a typical example, 
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Fig. 11 — Sixty cycle and switching waveforms for 83 Ni, 17 Fe wire (see Fig. 9) 
as a function of applied torsion. 
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the 60-cycle characteristics of the axial and circular flux versus axial 
drive and the switching voltage waveforms under pulse conditions are 
given in Fig. 11 for 2-mil wire of composition 83 Ni, 17 Fe. 

Note the negative prespike on the switching waveforms. By simultane- 
ous observation of both the axial and circular switching voltage wave- 
forms on many different magnetic wires it has been concluded that the 
negative prespike is due to an initial coherent rotation of the magnetiza- 
tion vector which results in an initial increase in the circular flux com- 
ponent. It is during this coherent rotation that the normal positive pre- 
spike on the axial switching voltage waveform is observed. Because of 
the mechanically introduced strain anisotropy, however, the magnetiza- 
tion vector is constrained to remain nearly parallel to the easy direction 
of magnetization. Thus, the coherent rotation soon ceases and the re- 
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Fig. 13 — A 320-bit experimental twistor memory array. The array is transistor 
driven. 


mainder of the flux reversal process is by an incoherent rotational process. 
During this latter time the circular and axial voltage waveforms are 
virtually identical. 

Fig. 12 gives the range of operation of 2-mil 83 Ni, 17 Fe wire as a 
twistor operated by mode A. As a result of the extreme squareness of 
the g-NI characteristic in helical direction the range of operation en- 
closes an area nearly the theoretical maximum. The switching times of 
other memory cells tested ranged from 0.2 usec for a 1 mil 4-79 moly- 
permalloy wire to 20 usec for a 5 mil perminvar wire. Thus it is seen 
that the switching speeds of the twistor compare quite favorably with 
those of conventional ferrite toroids and sheets. 

It is, of course, possible to store many bits of information along a single 
magnetic wire. The allowable number of bits per inch is related to the 
coercive force, the saturation flux density, and the diameter of the wire. 
For the nickel wire, about 10 bits per inch are possible. Predictions as 
to the storage density for a given material can be made by referring to 
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suitable demagnetization data. There are, however, interference effects 
between cells which are not completely understood at the present time. 

A memory array (16 X 20) has been constructed as a test vehicle. An 
illustration of this array is shown in Fig. 13. The drive wires have been 
woven over glass tubes which house the removable magnetic wires. 
Provision is made for varying the torsion and the tension of the in- 
dividual magnetic wires. 

As an indication of the performance of the twistor, Fig. 14 is a com- 
posite photograph showing the minimum and maximum signal over the 
16 bits of a given column for 3-mil nickel wire. Also included are the 
noise pulses for these cells, the so-called disturbed zero signals. The 
write currents were 2.3 ampere-turns on the solenoid and 130 ma through 
the magnetic wire. The read current was 6.0 ampere-turns. The array 





Fig. 14 — Composite photograph of the 16 output signals from a column of the 
array of Fig. 13. Average output signal about 3.5 millivolts; sweep speed equals 
2 wsec/cm. 


was transistor driven. A read-write cycle time of 10 microseconds ap- 
peared to be possible. 


V. DISCUSSION 


The twistor is presented as a logical companion to the coincident- 
current ferrite core and sheet.” ‘ In many applications it should compete 
directly with its ferrite equivalents. Perhaps its greatest use will be 
found in very large (>10°) memory arrays. 

From a cost per bit viewpoint the future of the twistor appears quite 
promising. Fabricating and testing the wire should present no special 
problems as it is especially suited for rapid, automatic handling. The 
possibility of applying weaving techniques to the construction of a 
twistor matrix looks promising. 

It is possible that, for both mode A and C operation of the twistor, an 
array can be built which consists simply of horizontal copper wires and 
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vertical magnetic wires — much like a window screen. Preliminary ex- 
periments have shown that single cross wires do operate successfully. 
The operation of this array would be analogous to a core memory ar- 
ray. Physically it could look just like a core array — but without 
the cores. 
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APPENDIX I 


From Fig. 10, for bulk circular flux reversal in a composite wire, the 
induced voltage V(r) for a wire length 1 is 








V(r) = (rs — nl (7? ‘| 107, O<r<n, 
. (26) 
=| (s — r)l & ‘| 10°, m™<r<fe. 
For a solid magnetic wire of radius r2 
vo) = (7B) 10 7) 
Therefore, 
V(r) = (25) V(O), O<r<n, 
2 
(28) 
V(r) = (ae _— ") V(0), mr <tr. 
2 


In general, the resistance of a tube of wall thickness dr is R(r) = pl/ 
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2ardr. The resistance of the wire is Ry = pipol/rloi(re’ — ri) + por’). 
The observable voltage for a length of wire, I, is 


— | WA  CveleDivider) 


pipel 
= =[ = : V(0) mipi(r? — 11°) + pers’ 
pil 
2rr dr 


pipal 
: (" (2=*) V (0) ete OT Bel 
TI 2 2 
2rr dr 





This reduces to 


(4 —a + 3a)pi/pe +a —a 
(1 — a?)pi/p2 + a? 


= bV(0), (30) 


where a = 7,/r2 and 0b is given by reference to (29). The ratio b/3 = 3b 
is the relative efficiency of the composite as compared to the solid 
magnetic wire from an available signal viewpoint. An expression for s, 
will now be derived. 

The total energy dissipated per unit length / 1s 


Vows = V(O) (29) 


Er/l = i) pia (r)2Qxr dr, (31) 
0 


where 7,(7) is the current density. Now, ta(r) = V(r) — Vous/pl, therefore 





CO eed a! me 6.28 2%: 
(32) 
=(1-"- 70) mn<r<tfe. 


The substitution of (82) into (81) yields, after manipulations, 
2 2 
Env/l oo meee. ee) a la 2 rs = a (1 = b)° 


p2 l 


(33) 
ne ee eee 


3) 2 
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T'rom (8), the applied energy per wire length lis 


E/l = (ease) me en: (34) 


where only that part of the applied energy associated with the high 
drive dynamic losses is included. HKquating (83) and (84) ane replacing 
V(0)/l by (27) results in 


24n73 
= (H — H))Ts = el ea {a C -a—by® 
~~ pol — a2) cos 6 | p1 (35) 
35 
4a(1 — b) 4 » 42 —0b) , 1h 
oe ee (2) = ee 
— (1 — bd)? + 3 5 sa ( b) 3 ag af 
This can be expressed as 
2 =o 
= (H aa Hy) ye — C eee (Bers) 10” (oe-usec). (36) 


p2 COS @ 


Bulk axial flux reversal in a composite magnetic wire can be treated in 
a manner analogous to that used in Section 3.1.1 for the solid wire. The 
uniform reversal of the axial flux induces a voltage V(r) in the wire where 


r\° n\? 
V(r) = Vr) (") — @ i} eee ae Ae Oe 
T9 To 
= Q), 0 <r< PL 
and V(r2) = [(2B,/T,)arz 10. Since E(r) = V(r)/2zr, 
2 
E(r) = Bs (- — io (37) 
iv r 


Following the procedure of Section 3.1.1., 


—16 r9 2 2) 2 
Besse _Fs10 Bs" (- _ ) rer 
- 


a (re = 11”) ry p2l's? 


: 38) 
_ a. E —a+a(2—In 4 ( 
pol's 1-— a? : 
where a = 11/r2 as before. Equating this expression to (8) yields 
Bsr’10° [1 — 40° + a3 — 4Ina) 
iC Se I he ea jie teG— Aine) 
( | T's p2 COS 8 1 — @ (39) 


24n—3 
0 a eas (oe-ysec). (40) 
pz COS 6 
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Non-Binary Error Correction Codes* 


By WERNER ULRICH 
(Manuscript received April 19, 1957) 


If a noisy channel is used to transmit more than two distinct signals, 
information may have to be specially coded to permit occasional errors to be 
corrected. If pulse amplitude modulation is used, the most probable error 
7s a small one, e.g., 6 1s changed to 7 or 5. Codes for correcting single small 
errors, and for correcting single small errors and detecting double small 
errors, in a message of arbitrary length, for an arbitrary number of duffer- 
ent stgnals in the channel, are derived in this paper. 

For more specialized situations, the error 1s not necessarily restricted to a 
small value. Codes are derived for correcting any single unrestricted error 
in a message of arbitrary length for an arbitrary number of different sig- 
nals. 

Finally, a set of codes based partially upon the Reed-Muller codes is 
described for correcting a number of errors in a more restricted class of 
message lengths for an arbitrary number of different signals. 

The described codes are readily tmplemented. Many techniques are used 
which have an analog in a binary system. Other techniques are broadly 
analogous to binary coding techniques or are special adaptations of a 
binary code. 


I. INTRODUCTION 
1.1 Use of Error Correction Codes 


One function of an error correction code is to aid in the correct trans- 
mission of digital information over a noisy channel. This process is 
illustrated in Fig. 1. An information source gives information to an 
encoder; the encoder converts the information into a message containing 
sufficient redundancy to permit the message to be slightly mutilated by 
the noisy channel and still be correctly interpreted at the destination. 
The message is then sent via the noisy channel to a decoder which will 
 * This paper was submitted to Columbia University in partial fulfillment of 
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of Engineering. 
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reconstruct the original information if the mutilation has not been ex- 
cessive. Finally, the information is sent to an information receptor. 

One scheme for correcting errors in a binary system is to send each 
binary digit of information three times and to accept at the receiver 
that value which is represented by two or three of the received digits. 
Then, the encoder is simply an instrument for causing each digit to be 
sent three times, and the decoder consists of a majority organ. However, 
many methods are available which are considerably more elegant, and 
which will permit more information to be passed through a noisy channel 
in a given unit of time. This paper will deal with such methods for 
channels capable of sending b different symbols instead of the usual 1 and 
O of a binary channel. 

The most convenient explanation of an error correction code has been 
made with respect to the transmission of correct digital information 
over a noisy channel. This does not imply the restriction of such codes 


INFORMATION oon 
INREMEEON ENCODER | CHANNEL =F (CORRECTOR) RECEP 


voll 


Fig. 1 — Transmission over a noisy channel. 


to the noisy channel problem exclusively. Actually, the first application 
considered for such a code was with respect to computers.! Many large 
high speed computers stop whenever an error is detected in some calcu- 
lation and must be restarted; with the use of an error correction code 
this could be avoided by permitting the computer to correct its own 
random errors directly. To the best knowledge of the author, error 
correction codes have not yet been used in any major computer. But 
the storage system of a computer may, in the future, lend itself to the 
use of error correction codes. 

Frequently, very elaborate precautions must be taken in present 
storage systems to insure that they are free from errors. Magnetic tapes 
must be specially made and handled to guarantee the absence of defects, 
magnetic cores must be carefully tested to make sure that no defective 
cores get into an array, cathode ray tubes used in Willams Tube or 
Barrier Grid Tube storage systems must be perfect. Probably, there are 
other storage methods whose development is hampered because of a 
common requirement for error-free performance in all storage locations. 
With the use of error correction codes, such storage systems could be 
used, if they are sufficiently close to perfection, even though not perfect. 
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It is not unlikely that the near future will see the development. of 
storage systems which will be able to store more than tio states at every 
basic storage location.? If such systems are developed, it seems likely 
that they will be more erratic or noisy than binary storage systems, 
since each location must store one of b signals instead of one of two. If a 
cathode ray tube storage system were used, for example, different quan- 
tities of charge would have to be distinguished; in a binary storage 
system, only the presence or absence of charge must be detected. This 
suggests that error correction codes may become essential with certain 
types of non-binary storage systems. One object of this paper is to 
develop codes for this purpose and to discover which number systems 
are most easily correctable. 

Some investigations have been made on the use of computer systems 
using multi-state elements.? A switching algebra has been developed 
similar to Boolean algebra for handling switching problems in terms of 
multi-state elements. Single device ring counters (the cold cathode gas 
stepping tube for example) already exist and might be useful in such 
systems. But currently, only limited steps in this direction have been 
made. Another object of this paper is to show the advantages and 
problems of error correction codes in multi-state systems; it is not un- 
reasonable to predict that error correction codes may be more necessary 
in multi-state systems than in binary systems. 


1.2 Geometric Concept of Error Correction Codes 


A geometric model of a code was suggested by R. W. Hamming! 
which can be altered slightly to fit the non-binary case. For an n digit 
message, a particular message is a point in » dimensional space. A 
single error, however defined, will change the message, and will cor- 
respond to another point in n dimensional space. The distance between 
the original point and the new point is considered to be unity. Thus, 
the distance d between the points corresponding to any two messages is 
defined as the minimum number of errors which can convert the first 
message into the second. 

With an error detection and/or correction code, the set of transmitted 
messages is limited so that those which are correctly received are recog- 
nizable; those messages which are received with fewer than a given 
number of errors are either corrected or the fact that they are wrong is 
recognized and some other appropriate action (such as stopping a com- 
puter) is taken. | 

In the case of binary codes, an error changes a 1 to 0 or a 0 to 1. In 
the non-binary case, two definitions of an error are possible and will be 
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used in this paper. A small error changes a digit to an adjacent value. 
In a decimal system, a change from 1 to 2 or 1 to 0 is a small error. An 
unrestricted error changes a digit to any other value. In a decimal sys- 
tem, a change from 1 to 5 is an unrestricted error. 


1.3 Material To Be Presented 


The various types of codes described in this paper and the sections 
in which they are to be found are summarized in Table I. The tech- 
niques which are described are summarized below. 

The geometric model suggests the simplest approach to error correction 
codes. A transmitter has a ‘‘codebook”’ containing all members of the 
set of transmitted messages. If the message source gives to the encoder 
the signal that the information to be sent is & (that is to say, the kth 


TABLE I — TyrEs or CopEs 





Type of Code Distance Type of Error Pee in 

Single Error Detection 2 Small and Unrestricted II 
Single Error Correction 3 Small IIT and 6.1 
Single Irror Correction 3 

Prime Number Base Unrestricted 4.1 

Composite Number Base Unrestricted 4.2 
Single Error Correction 4 Small V and 6.1 

and 

Double Error Detection 
Multiple Error Correction — Small 6.2 


output of all the outputs associated with the message source), the en- 
coder chooses the kth member of the set. The decoder will then look up 
the message it receives in its own codebook which contains all possible 
recelved messages, and corresponding to the entry of the received mes- 
sage will find the symbols corresponding to k. Or the receiver may 
compare the received message with every member of the set of trans- 
mitted messages, calculate the distance between the two, and correct 
the received message to whichever of the transmitted messages 1s sep- 
arated from the received message by the smallest distance. (It has been 
shown by Slepian‘ that this is the message most likely to be correct in a 
symmetrical binary channel having the property that changes from 1 to 
0 and from 0 to 1 as a result of noise in the channel are equally likely.) 

The practical difficulty with such a code is the large size of the re- 
quired codebooks. Most coding schemes try to eliminate such codebooks 
and substitute a set of rules for encoding, decoding and correcting 
messages. 
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One approach toward creating a simple association between the infor- 
mation and the message is to use some of the digits of the message for 
conveying information directly. The Hamming Code! uses this tech- 
nique. | 

An information digit is a digit of a message that 1s produced directly 
by the information source; in a base b code, an information digit may 
have b different values, the choice between these values representing the 
information that is to be sent. 

A check digit is a digit of a message that is calculated as a function of 
the information digits by the encoder. It is sometimes convenient to 
represent or calculate a check digit in terms of a recursive formula using 
previously calculated check digits as well as information digits. In a 
base b code, a check digit may have b check states. When more than one 
check digit is used, each different combination of check digits corre- 
sponds to a different check state for the message; a message with m 
check digits will have b” message check states. 

A systematic® code encoder generates messages containing only infor- 
mation digits and check digits. The information source generates only 
base b information digits. The Hamming Code is a systematic code. 

Section IT offers a general method for obtaining single error detection 
codes for both small and unrestricted errors. The idea of mixed digits 
(digits which are, in a sense, neither information nor check digits, but a 
combination of both) is introduced, and it is shown how mixed digits 
may lead to more efficient coding systems. This idea is believed to. be 
novel. Code systems which use mixed digits are called semi-systematic 
codes. Semi-systematic codes are used extensively throughout this 
paper. 

Section III offers a general method for obtaining single small error — 
correction codes, including both systematic and semi-systematic codes. 

Section IV offers a general method for obtaining the more complicated 
single unrestricted error correction codes. The problem is divided into 
two parts. Section 4.1 describes codes for correcting single unrestricted 
errors in case 0b, the base of the channel, is a prime number.* Section 4.2 
describes a special technique for obtaining the more complex codes for 
correcting single unrestricted errors in the event b is a composite num- 
ber. 

Section V offers a general method for obtaining semi-systematic codes 
for correcting single small errors and detecting double small errors. No 
general solution has been found for obtaining single error correction or 
double error detection codes for the case of unrestricted errors. No gen- 


* This class of codes was previously described in a brief summary by Golay.¢ 
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eral solution has been found for multiple error correction codes for the 
unrestricted error case. 

In Section VI, a number of techniques are presented for using binary 
error correction coding schemes for non-binary error correction codes. 
Section 6.1 shows how such techniques may be used to obtain non-binaryv 
single error correction codes, and single error correction double error 
detection codes, for the small error case. Section 6.2 presents a special 
technique, involving the use of an adaptation of the Reed-Muller binary 
code, to obtain a class of non-binary multiple error correction codes, for 
the small error case. 

Section VII shows that an iterative technique of binary coding can be 
directly applied to non-binary codes. It also shows how an adapted 
Reed-Muller code can be profitably used in such a system. 

Section VIII summarizes the results obtained in Sections II-VII and 
shows the advantages and shortcomings of many of these codes. 

Section IX presents general conclusions which may be drawn from 
this paper. 


II. SINGLE ERROR DETECTION CODES 


Single error detection codes require message points separated in n 
dimensional space by a distance of two. 

Yor the binary case, the only two possible types of errors are the 
change from a 1 to a 0 and froma Otoal. 

A simple technique that is used frequently for binary error detection 
codes is to encode all messages in such a manner that every message 
contains an even number of 1’s. This 1s accomplished by adding a parity 
check digit to the information digits of a message; this digit is a I if an 
odd number of 1’s exist in the information digits of a message and is a 
0 if an even number of 1’s exist in the information digits. At least two 
errors must occur before a message containing an even number of 1’s 
can be converted into another message containing an even number of 
1’s, since the first error will always cause an odd number of 1’s to 
appear. A message with an odd number of 1’s is known to be incorrect.* 

An analogous technique may be used for the unrestricted error case in 
non-binary codes. We can obtain a satisfactory code by adding a com- 
plementing digit to a series of information digits to form a message. 

A complementing digit, base b, is defined as a digit which when added 
to some other digit will yield a multiple of b. 

* Parity check digits may be selected to make the number of 1’s in a message 


always odd, but the principle is the same; in this case, an error is recognized if a 
received message contains an even number of 1’s. 


NON-BINARY ERROR CORRECTION CODES 1347 


Tor a single unrestricted error detection code, the complementing 
digit complements the sum of the information digits. A complementing 
digit is a check digit. In the binary case, it is a parity check digit. 

As an example, consider a decimal code of this type. A message 823 
would require a complementing digit 7, making the total message 
8237 (8 +2+3+ 7 = 20, a multiple of 10). An error in any one digit 
will mean that the sum of the message digits will not be a multiple of 10. 

For the small error case, it is sufficient to make certain that the sum 
of all digits is even since any error of +1 would destroy this property. 
For the binary case, all errors are small since the only possible error on 
any digit is a change by -++1; a simple parity check is adequate. For a 
non-binary code, it would be wasteful to add a digit just to make sure 
that the sum of all digits is even. In a decimal code for example, if the 
sum of the message digits is even, the values 0, 2, 4, 6, 8 for the cheek 
digit will satisfy a check, or if the sum of the message digits is odd, the 
values 1, 3, 5, 7, 9 will satisfy the check. More information could be 
sent if a choice among these values could be associated with informa- 
tion generated by the information source. 

This introduces the concept of a mixed digit; i.e., a digit which conveys 
both check information and message information. 

A mixed digit is defined as follows: a mixed digit 2, base b, is composed 
of two components (y, 2) where y represents an information component 
and z represents a check component. The number of information states 
of a mixed digit is 8, with y taking the values 0, 1, ---, 8 — 1; the 
number of check states of a mixed digit is a, the number base of z. 
In a message containing m check digits and h mixed digits, the number 
of check states for the message is b”™-a1-:a2:...-a,, Where a; is the 
number of check states of the 2’th mixed digit. 

If mixed digits are used as part of a code, information must be avail- 
able in at least two number bases; b, the number base of the channel, 
and 8, the number base of the mixed digit. A situation where this arises 
naturally is in the case of the algebraic sign of a number; this is a digit 
of information, base 2, which may be associated with other digits of 
any base. Similarly, any identification which must be associated with 
numerical information can be conveniently coded in a number base 
different from the number base of the numerical information. Thus, a 
mixed digit can sometimes be used conveniently in an information trans- 
mission system without complicating the information source and re- 
ceptor. 

An error detection code for single small errors suggests the use of a 
mixed digit. In the decimal code for example, the quibinary’ representa- 


1348 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1957 


TABLE II — QuriBINARY CODE 


Quinary Component Binary Component Decimal Digit 
0 0 0 
0 1 ~] 
| 0 py 
1 ] 3 
2 0 4 
2 1 5 
3 0 6 
3 1 7 
4 () 8 
4 I 9 


/ 


tion of the mixed digit might be used, letting the quinary component of 
the mixed digit convey information and the binary component a check. 
(Table IT.) 

The information source generates blocks of decimal digits followed by 
one quinary digit. The messages are then generated in the following 
way: record all decimal information digits as information message digits 
and take their sum; if the sum is even, the binary component, z, of the 
mixed digit is 0, otherwise it is 1. The quinary component, y, of the mixed 
digit is taken directly from the information source and combined with 
the calculated binary part by the rules of the quibinary code to form 
the mixed decimal digit. Thus, x, the value of the mixed digit, is given 
by the formula: 


ge = Qy + z. (1) 


Tor example, if the decimal digits of a message are 289 and the quinary 
digit of the message is 3, the mixed digit is 7, and the message Js 2897. 
The sum of the decimal information digits is 19, which is odd, so that 
the binary component of the mixed digit is 1; this is combined with the 
quinary component, 3, by the rules of the quibinary code table, to form 
decimal digit 7. The requirement that the sum of all digits be even is 
satisfied by the binary component of the mixed digit, and the informa- 
tion associated with the mixed digit is contained in the quinary com- 
ponent. 

This method is easily extensible to any other number base and is also 
extensible to the case of slightly larger but still restricted errors (such 
as +1 or +2), provided that the maximum single error is less than 
(b — 1)/2. 

Irom the preceding example, it is apparent that mixed digits can be 
usefully employed in error detection codes. The use of mixed, check and 
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information digits simplified the encoder and decoder. To differentiate 
among the classes of codes which will be described in this paper, the 
following terms will be used, in addition to those previously defined. 

A semi-systematic code encoder produces messages containing only 
information, mixed and check digits. The information source generates 
information digits in base 6 for information digits, and in base @ for 
mixed digits. (The example given above is a semi-systematic code.) 

Of two coding schemes in the same channel base b, each working with 
messages of the same length, and each satisfying a given error detection 
or correction criterion, the more efficient scheme is defined as the one 
which produces the larger number of different possible messages. 


III. SINGLE ERROR CORRECTION CODES, SMALL ERRORS (£1) 


The problems of error correction codes in nonbinary systems are ex- 
tensive and must be treated in several distinct sections. The basic differ- 
ence between the error correction problem in binary and non-binary 
codes is the fact that the sign of the error is important. In a binary 
code, if the message 11 is received and it is known that the second digit 
is incorrect, only one correction can be made, to 10. But in a decimal 
code with errors limited to +1, if the message 12 is received and it is 
known that the second digit 1s wrong, it can be changed to either 11 or 18. 

Consider the following simple code for correcting single small errors. 
A decimal channel is used, and a message is composed of three informa- 
tion digits and one check digit. Let x; represent the check digit and 22 , 
x3, 24 the information digits. Here, x 1s chosen to satisfy* 


21 + 2x2 + 823 + 4a, = 0 mod 10. (2) 


The encoder calculates x, , and transmits the message 2%2232,. This is 
received as 2;/%2'x3'x4'. The decoder then calculates ¢ given by 


C= ON + 2x! + 323" + Aay’) mod 10. (3) 


If the assumption is made that at most a single small error exists, then 
this error can be corrected by using the following rules, which may be 
verified by Inspection. 

If c = 0, no correction is necessary ; 

5 > ¢ > 0, decrease the cth digit by one; 

* By definition a = c mod b is equivalent to a = c + nb, where a, b, c and n 
are integers. The equality notation is used in preference to the congruence nota- 
tion throughout this paper, since an addition performed without carry occurs 
naturally in many circuits; in terms of such a circuit, the mod 6 signifies only the 


base of the addition, and a true equality exists between the state of two circuits, 
with the same output even though one has been cycled more often. 
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5 < c, increase the (10 — c)th digit by one; 
= 5 implies a multiple error or a larger error. 

Since the value of ¢ is used for correcting a received message, it is 
called the corrector.* For the general case, a corrector is defined as 
follows. . 

In a message encoded to satisfy m separate checks, the result of cal- 
culating the checks for the received message at the decoder is an m digit 
word called the corrector. There are as many possible values of the cor- 
rector as there are check states of the message, although all of the 
values of the corrector need not correspond to a correctable error. 

It is important that, for a given transmitted message, every different 
error will lead to a different value of the corrector; otherwise there will 
be no way of knowing which correction corresponds to a particular value 
of the corrector. The number of correctable errors may be far less than 
the number of possible values of the corrector, so that not all of these 
values may be useful for a code to correct a particular class of errors. 
However, the number of corrector states sets an upper limit to the num- 
ber of possible corrections. 

Tor many codes, it 1s convenient to associate a particular value of a 
corrector for the condition that a particular digit has been received too 
high by a single increment, for example, a 7 received as an 8. 

The characteristic of a digit for a particular code is defined as the value 
of the corrector if that digit is incorrectly received, the error having 
increased the value of the digit by +1, and all other digits are correctly 
received. Obviously, this definition only applies to those codes having the 
property that the value of the corrector 1s independent of the value of the 
incorrect digit and of the other digits. 

A simple characteristic code encoder produces messages in which each 
digit has a distinct characteristic as defined above. 

The Hamming code is an example of a simple characteristic code as 
is the code previously described. In that example, the characteristic of x; 
is 2. 

The advantage of a simple characteristic code for single small error 
correction is obvious: the association between the calculated checks 
and the correction to be performed is simple and does not depend on the 
values of the digits of the message. 

The following example of a simple characteristic code will illustrate 
this principle more fully. 

Consider a single small error correction code, working with a quinary 


* The terms corrector and characteristic were first used in a more restricted 
sense 1n an article on binary coding by Golay.® 
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(base 5) channel. Each message will consist of ten information digits 
and two check digits. 

Let x; and x2 represent the check digits, and x3, x4, °°* , V12 represent 
the information digits. 

The equations for calculating x and 22 are: 


la, + Ox, + Ox3 + lag + las + lave + lez 

+ 2x + Qa + 2a + 2a + 2a = 0 mod 5, 
Oa, + lave + 273 + lay + 245 + 382x—6 + 427 

+ Oag + lag + 2ay9 + 341 + 42412 = 0 mod 5. 


(4) 


(5) 
At the decoder, the corrector terms, c,; and cz, are calculated using 7,’, 
the received value of x; , in the following formulas: 
lay’ + Oae’ + Oars’ + Lay’ + Laos’ + 1a’ + 127 
| + Qa’ + Qry’ + Qayo’ + 2a’ + 2a’ = c, mod 5, 
Oxy! + lao’ + 2a’ + Lag’ + 225’ + 8x6’ + 4277 
+ Ox’ + 1a’ + 2a’ + 32’ + 4212’ = co mod 5. 


(6) 


(7) 


The values of cic2 corresponding to the condition that one and only 
one digit is too high by 1, x,’ = x; + 1, can be read by reading the coeffi- 
cients of the 7th digit in the corrector formulas. This quantity 1s therefore 
the characteristic of the zth digit. If x,’ = x; — 1, then the fives com- 
plements of these coefficients will be the value of the corrector. Table III 
lists the characteristics and characteristic complements associated with 
each digit. 


TABLE III — CHARACTERISTICS AND CHARACTERISTIC COMPLEMENTS 
SYSTEMATIC QUINARY CODE 


Digit Characteristic Complement of Characteristic 
X1 10 40 
Xe 01 04 
X3 02 03 
X4 11 44 
X5 12 43 
X6 13 42 
X7 14 4] 
X8g 20 30 
X9 21 34 
X10 22 30 
X11 23 32 
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In this code all the possible values of cic. correspond to the charac- 
teristic of a digit or the complement of this characteristic, except 00 
which corresponds to the correct message. (An inspection of equations 
(4) through (7) reveals that if x;,7 = 2; for all values of 7, the values 
of c, and c. are 0). Thus, we can assign a unique correction to each 
value of ci¢2 . 

The above techniques are extensible to other number bases and dif- 
ferent length words provided b, the number base of the channel, is 
greater than 2. (The equivalent binary channel problem has been treated 
by Hamming.’) The following set of rules and conventions may be 
used for deriving a satisfactory set of characteristics for a simple charac- 
teristic systematic code used to correct single small errors for any length 
message, and any base, b = 3. The rules must be followed, and the 
conventions (which represent one pair of conventions out of the set of 
pairs of conventions, which together with Rules 1 and 2 can be used for 
deriving a code of this class) if followed, will lead to a reasonably simple 
method for encoding and decoding messages.* Since the rules, not the 
conventions, limit the efficiency of the code, no set of conventions can 
be found which will lead to a more efficient code of this class. 

Rule 1. Yor an n digit message (including check digits), m check dig- 
its are required and m must satisfy the following inequalities: 


ie ee 
9 ? 
i) ae 


2 


IV 





if b is odd, (8a) 


IIV 


if b is even, n. (8b) 

Rule 2. No characteristic may be repeated; 1.e., each digit must have 
a characteristic different from that associated with any other digit. 

Convention 1. The various digits of a characteristic are arranged in a 
set order; 1.e., Ci;, Cor, -+- , Cmi. The first digit which is neither zero, 
nor (in case b is even) b/2, must be less than b/2. There must be at least 
one such digit. | 

Convention 2. The characteristic of the jth check digit has a 1 in the 
ath position and 0’s elsewhere. 

Rule 1 is required since, for a code of this type, we must be prepared to | 
correct any digit in one of two ways (+1). This implies a minimum of 
2n + 1 values of the corrector, one for each possible correction, and one 
for the case of no corrections. This means that b”, the number of possible 


* The above distinction between rules and conventions will be observed 
throughout this paper. 


NON-BINARY ERROR CORRECTION CODES Lodo 


values of the corrector, must be at least 2 n + 1, equation (8a). For 
even bases, we must reject all values of the corrector containing only 
the digits O and b/2 for representing error conditions for the following 
reasons: a positive error leads to a corrector that is the characteristic of 
the incorrectly received digit, and a negative error leads to the b-com- 
plement of such a characteristic. In order to have unique error correc- 
tion, we must be able to distinguish between these two conditions. If a 
characteristic were to contain only the digits 0 and b/2, it would be equal 
to its own b-complement; such combinations of digits are therefore not 
useable as characteristics or characteristic complements. 

Rule 2 is required to permit a unique identification of an incorrect 
digit in case of a single error. 

Convention 1 allows us to distinguish between positive and negative 
errors. By observing this convention, a characteristic (corresponding to a 
positive error) can be distinguished from its complement (corresponding 
to a negative error) by inspecting the first digit of a corrector which is 
neither 0 nor b/2. A characteristic will have this digit less than b/2, 
a characteristic complement will have this digit greater than b/2. If 
the corrector 1s a characteristic, the correction is minus one; if it 1s a 
characteristic complement, it is plus one. 

Once the characteristics have been chosen, the corresponding encoding 
procedure may be performed in the following manner: Let a;; represent 
the jth digit of the characteristic of information digit x; . Let z; represent 
the check digit which has a characteristic contaming a 1 in the jth 
position. If convention 2 has been observed, (9) can be used to cal- 
culate 2; : | 


2 Ayjet, = ej mod b. (9) 


An encoder calculates each z; and inserts it into the message in those 
digit positions which have the characteristic of the jth check digit as- 
signed to them. | 

In more general terms, we use implicit relations that are equivalent 
to the explicit equations given by (9). Letting z; represent an informa- 
tion or a check digit, and letting C;; represent the jth digit of the charac- 
teristic of the 7th information or check digit, these formulas may be re- 
written as 


2 C2; = 0 mod b. (10) 


At the receiver, the decoder calculates m different check sums. Let c; 
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represent the check sum corresponding to the jth corrector term, and 
x; represent the received value of x; : Then, 


>, Cyx,;’ = c; mod b. (11) 
f=1 


The difference between equations (10) and (11) is the result of any 
mutilations caused by the channel. If no error has occurred, all the c;’s 
are 0; if an error of +1 has occurred, the mc,’s will form the characteristic 
or the characteristic complement, respectively, of the incorrectly re- 
celved digit. 

One disadvantage of a systematic code is the discontinuity in the 
number of check states as a function of m, the number of check digits. 
For example, in decimal code one check digit 1s required for a message 
of up to four digits, and two check digits for up to forty-eight digits. 
Obviously, for a message of intermediate length, for example, twelve 
digits, many of the corrector states cannot be used for single error cor- 
rection since they will not correspond to any single error. A more effi- 
cient code would be obtained if the check states were limited to a smaller 
number. 

One method of reducing the number of check states is to perform the 
check in a different modulus than the modulus of the channel. In the 
single error detection code using a mixed digit, binary check informa- 
tion and quinary message information was conveyed by this digit. This 
code was more efficient than a systematic code because each message 
contained the minimum number of check states which js 2. 

If a mixed digit, x, is composed of the two components (y, z) where 
is the information state of the digit and z the check state, it 1s conven- 
ient to combine these two components to form x by means of the formula 


x= ay +z. (12) 


We calculate z by using a linear congruence equation modulo a. 

The use of this formula permits a decoder to act on x’, the received 
value of x, directly, without first resolving v’ into y’ and 2’, because (12) 
insures that x’ = y’ mod a. This permits x’ to be corrected directly and 
then resolved into its components. 

As an example, consider a semi-systematic code for correcting a single 
small error in a decimal system, using a twelve digit message; ten of the 
digits are information digits and two are mixed digits, each conveying 
binary message information and quinary check information. (One of 
these binary digits might represent the sign of the number.) 

With two quinary checks, twenty-five different check states are pos- 
sible; for correcting single small errors in a twelve digit message, twenty- 
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TABLE [TV — CHARACTERISTICS AND CHARACTERISTIC COMPLEMENTS, 
SEMI-SYSTEMATIC DECIMAL CODE 





Digit Characteristics Characteristic Complements 
x, (mixed digit) 1 O 4 0 
xo (mixed digit 0 1 0 4 
X3 0 2 0 3 
X4 1 1 4 4 
X5 1 2 4 8 
X¢ 1 3 4 2 
X7 1 4 4 |] 
Xg 2 0 3.0 
X9 2 1 3.44 
X10 2 2 3.3 
X11 2 3 3. 2 
X12 2 4 3.1 


five corrector states are required, one for each of the two possible cor- 
rections (+1) for each digit, and one for the case of a correctly received 
message. Characteristics may be chosen for the various digits in accord- 
ance with the rules and conventions outlined above in this case, since 
the check modulus is the same for both check digits. Consequently, it 
is no accident that these characteristics, shown in Table IV, are the 
same as those shown in Table III. 

Let Ca and Cj represent the characteristic of the 7th digit, and let 
y, and ye represent the two binary information digits. Then: 


12 

De Cir; = 2, mod Q, Ly => 2 + oY, (13) 
1=3 

12 

> Cyt; = —2, mod 5, Xo = 22 + Sye. (14) 
1=3 


Because 2 = 2, mod 5 and 22 = Zz. mod 5, these relations can be re- 
written implicity to resemble equation (10): 


12 


> Cuxv; = 0 mod 5, (15) 


» Cie; = 0 mod 5. (16) 


At the decoder, the corrector cic: 1s calculated by: 


12 
2 C2; = ¢,mod 5, 7 (17) 


12 
2d Cae = Co mod 5. (18) 
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If the corrector is 00, the message has been correctly received; other- 
wise, the corrector is either the characteristic or characteristic comple- 
ment of the incorrect digit, from which plus one or minus one respec- 
tively must be subtracted as a correction. | 

Consider the general case. Let 1, 22, --- , 4, represent the k informa- 
tion digits; yi, Ye, ‘°° , Ym represent the information state of the m 
mixed digits, and 21, 22, °°: , 2m represent the check state of the m 
mixed digits. In addition, let a, , a2, --- , am represent the number base 
Of 21, 22, ‘°° , 2m respectively; 81, Be, -:-, Bm represent the number 
of possible states of yi, Yo, °°* , Ym respectively, and 241, Ue42, °°°, 
Lk+m Tepresent the values of the mixed digits after the message has been 
encoded. (Note that for simplicity, a check digit is considered as a special 
case of a mixed digit; its information state is permanently 0.) The follow- 
ing encoding procedure may be used in which 2, 22, --- , % are used 
directly as part of the transmitted message. This is a semi-systematic 
code, which means that information digits are not changed in coding. 
To derive the mixed digits, the following formulas are used: 


Ay1X1 + See ALi; | = —21 mod 1 (19-1) 
Lksy = Yor + 21 (20-1) 
yet, + ees 1 Aart, 1 AeeqayVeEsy = —2, mod az (19-2) 
Tk4+2) = Y2a2 + 22 (20-2) 
BAdy Hy Se Se Ogee ee A pag AY 


= —z;moda;_ (19-}) 


Zee) = YsQg + 2; (20-3) 
ya ge 0 Py Am(k+m—1)U(k4+m—-1) = —&m mod Om (19-m) 
Cada = Unie a ee (20-m) 


In each case, the value of the check component z;, of a mixed digit 
X43) is determined by a formula involving the information digits and 
previously calculated mixed digits. Immediately after z; has been de- 
termined, q+, 1s calculated for possible use in calculating 2 j4n . 
After the message has been completely encoded the following equations, 
analogous to (10), will be satisfied. | 
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Let Ci; represent a;; in equation (19-}). Then, 
k+m 


De C523 = 0 mod Qj. (21) 
i=l 


(Since ta+4;) = 2; mod a;, substitution of v4, for z; In equation (19-]) 
will continue to satisfy the equation.) 
At the decoder, equation (21) is changed to 
k-+m 


oS Cire. = c; mod a; (22) 
i=l 


In (22), x,’ represents the received value of x; , and c; represents the jth 
digit of the corrector. If all the digits have been correctly received, 1.e., 
x; = x; for all values of 7, then c, = co = +--+ = Cm = 0; [see equation 
(21)]. If x, had been received incorrectly so that x,’ = 2, + 1, but all 
other digits had been correctly received, then the value of c; (the jth 
digit of the corrector) would be calculated in the following manner: 

k-+m 


Cj mod a; = Se C355 
i=1 


k-+m 

Cj mod ap = 2 C5524 + Ch; — Ch; (23) 

Equation (23) proves that C,; 1s actually the jth digit of the charac- 

teristic of x, , because by definition, the characteristic of x, 1s the value 

of the corrector when a,’ = x, + 1, and all other digits have been cor- 

rectly received. This means that the general term, C;; of (21), is actually 

the jth digit of the characteristic of the ith digit and that this is a simple 
characteristic code. 

For the case that x,’ = x, — 1, the value of the corrector is such that 
if it were incremented, digit by digit, by the characteristic of 2, , the 
corrector would be composed only of zeros. Incrementing the corrector 
by the characteristic of zx, is equivalent to recalculating the corrector 
with x,’ increased by one, which in this case would amount to calculat- 
ing the corrector for the case of a correctly received message. The 
latter is composed of all zeros [see (21)]. Thus, for the case of a single 
error of —1, the corrector is the characteristic complement of the digit 
which is incorrectly received. For a semi-systematic or systematic code, 
the characteristic complement is an m digit word whose jth digit is the 
complement modulo a; of the jth digit of the characteristic. 

Equation (20-]) shows that generally a;8; cannot exceed b. (An ex- 
ception is given below.) The maximum value of y; is 8; — 1 since y isa 


7" 


1358 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1957 


digit in the number base 8; . The maximum value of z; is usually a; — 1, 
since z; 1s a digit in the number base a,;. Thus, 


Troi = Yay +2; Sb —1, (24) 
(Oe Dey sag el SOL, (25) 
ajB; = 0. (26) 


Equation (24) restates (19-}), and also states that the maximum value 
of any digit x, is b — 1, where b is the number base of the channel. In 
(25), the maximum values of y; and z; are substituted to yield the result 
shown in (26). 

It was stated above that the maximum value of z; is usually a; — 1. 
An exception occurs only in case z; checks only itself and other mixed 
digits, the latter being restricted to fewer than b — 1 states. Under such 
circumstances, the value of z is sometimes restricted, so that even though 
z is calculated to satisfy a check, modulo a; [see equation (19-})]|, it can- 
not assume a; — 1 values. For example, a code for transmitting a 
single digit message over a decimal channel and permitting the correc- 
tion of small errors, might use as the set of transmitted messages the 
digits, 0, 3, 6, 9. In this case, a = 3 (any correct message satisfies the 
check x = 0 mod 8) and 8 = 4 since four different messages may be 
transmitted. In this case, z is restricted to the value 0 because the mixed 
digit checks only itself. 

In order to correct single errors of +1, using a simple characteristic 
code, it is necessary and sufficient that every characteristic be different 
from every other characteristic, and that it also be different from the 
complement of every other characteristic. 

The following rules and conventions may be used to derive a set of 
characteristics which meet the requirements for a simple characteristic 
semi-systematic or systematic code for correcting small errors for any 
base b = 3 and an arbitrary length message. No set of conventions can 
be found which will lead to a more efficient code of this class, since the 
rules, not the conventions limit the efficiency of the code. 

Rule 1. For an n digit message, including mixed digits, containing m 
mixed or check digits of which m, are associated with an even modulus, 
a, the inequality 


(aya: coe *Am 271) /2 aa () (27) 


must be satisfied. 
Rule 2. No characteristic may be repeated, i.e., each digit must have 
a characteristic different from that associated with any other digit. 
Rule 3. Since the mth check is the last one to be calculated, and the 
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characteristic of the mth mixed digit must therefore contain only a single 
digit which is not 0, am must be greater than 2. 

Convention 1. The various digits of a characteristic are arranged in a 
set order, 1e., Ca, Cie, ++: , Cim. The first digit which is neither 0 nor 
a;/2 must be less than a;/2. There must be at least one such digit. 

Convention 2. The characteristic of the jth mixed digit has a 1 in the 
jth position and 0’s elsewhere, provided that a; # 2. If a; = 2, the char- 
acteristic of this mixed digit has a 1 in the jth and mth positions, and 0’s 
elsewhere. 

Rule 1 is required because the number of possible corrector states is 
Qy'Q2* ... *@m, Of which only those containing at least one digit which 
is neither 0 nor a/2 can be associated with the 2n possible errors. The 
same reasons used for Rule 1 for the systematic code case are equally 
applicable here; a characteristic containing only the digits 0 or a;/2 in 
the jth position is not distinguishable from its complement. 

Rule 2 is required to permit a unique identification of an incorrect 
digit. | 

Rule 3 is necessary to derive the sign of an error on the mth mixed digit. 

The reasons for using Conventions 1 and 2 in the case of the sys- 
tematic code are equally applicable in this case. For the case a = 2, 
however, a special convention must be used to avoid a conflict with 
Convention 1. 

The procedure for converting a set of characteristics into an error 
correcting code system is the same for a semi-systematic code as for a 
systematic code except that the following additional functions must be 
performed: the encoder must combine check states with information 
states to derive mixed digits, and the decoder must resolve mixed digits 
into information and check digits after it has performed its corrections. 

By using these rules and conventions, the most efficient simple charac- 
teristic code can be determined. For messages of length n (including 
mixed or check digits), the following relations must be satisfied: 

Let 


ve —= Gy" He" «1. "Am, 
Q _ B1- Be: coe "Bm , 


m = number of even a’s. 


Then: 


V 


(PS 2 ny (28) 
a iB; Ss b. (29)* 


* For exceptions, see above. 
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TABLE V — DEcIMAL ERROR CORRECTION CODES 


N yi oO 1,02, ... Big B55 25 — 2n+i1 
1 3 2.5 3 4 3* 
2 5) D 5 2 5 
3 10 10 10 1 7 
4 10 10 10 1 9 
5 15 16.7 5, 3 2, 3 11 
6 15 16.7 5, 3 23 13 
7 15 16.7 5, 3 2) 3 15 
8 20 20 10, 2 15 17 
9 25 25 5, 5 2) 2 19 
10 25 25 5, 9 2,2 21 
1] 25 25 5, 9 2,2 23 
12 25 25 55 2 2 25 
13 30 33.3 10, 3 13 27 
14 30 30.3 10, 3 1,3 29 
15 40 40 10, 2, 2 1.5.5 * 31 
16 40 40 10, 2) 2 15,5 33 
17 50 50 10,5 12 35 
18 50 50 10, 5 132 37 
19 50 50 10, 5 1,2 39 
20 50 50 10, 5 12 41 


* The single digit message containing the points 0, 3, 6, 9 is an exception to 
the inequality a8 S b, because the mixed digit checks only itself. 


Tor the most efficient code b”/Q should be minimized. This term repre- 
sents the ratio of the number of possible messages for an n digit message 
with and without error correction. This is normally at least as great as 
2n + 1, the number of possible corrections on such a message. 

Table V shows the most efficient decimal codes of this type for an n 
digit message, for values of n from 1 to 20. Where two or more different 
codes are equally efficient, the code with the fewest mixed digits is shown. 
It is easy to convert from a code using two mixed digits with a; = 5, 
a, = 2, to one using a check digit with a = 10, or to make the inverse 
conversion, and to show that both codes are equally efficient. 


IV. SINGLE ERROR CORRECTION CODES, UNRESTRICTED ERROR 


The problem of correcting an unrestricted error on one digit of a 
message must be divided into two categories, depending on whether b 
is a prime number or a composite number. As will be seen, the error 
correction problem for prime bases is considerably simpler than that for 
composite bases. The method for correcting errors in prime number 
systems was discovered by Golay,® although this did not come to the 
author’s attention until after he had worked out the same method. The 


NON-BINARY ERROR CORRECTION CODES 1361 


adaptation to non-prime channel bases is believed to be novel. Since the 
adaptation makes use of the code for prime bases, both will be described. 


4.1 Prime Number Base, Single Unrestricted Error Correction Code 


This code depends upon a fundamental property of prime numbers, 
well known in number theory.’ Let p represent a prime number and d, 
c, and w represent non-negative integers less than p, related by the 
expression: 


dw = cmod p. (30) 


If d ¥ 0, then d and c uniquely determine w. 

In order to have a simple characteristic systematic code for correcting 
unrestricted errors, 1t 1s necessary and sufficient that the set of charac- 
teristics shall have the property that all multiples of all characteristics 
are distinct. Equation (30) implies a unique correspondence between mul- 
tiples of a characteristic and the characteristic itself, if we consider c to 
be the multiple, d the multiplying factor and w a digit of the charac- 
teristic. An error, d, is simply identifiable if a known digit of a charac- 
teristic is always 1. If each characteristic is distinct from every other and 
if a sufficient number of check digits are available, a simple characteristic 
code can be obtained. In the following set of rules and conventions which 
may be used for deriving a set of characteristics for a simple charac- 
teristic systematic code for correcting single unrestricted errors, p repre- 
sents the prime number base of the channel. The number base of the 
channel must be prime, and the length of the message is arbitrary. Since 
the rules and not the conventions limit the efficiency of the code, no other 
set of conventions may be found which will lead to a more efficient code 
of this class. 

Rule 1. For an n digit message, m check digits are required and m 
must satisfy the inequality 


po —i (31) 
p—l1 

Rule 2. Each digit must have a different characteristic. 

Convention 1. The digits of a characteristic are arranged in a set 
order, 1.e., CaCi +--+ Cim. The first digit which is not 0 must be 1. 

Convention 2. The characteristic of the jth check digit has a 1 in the 
ath position and 0’s elsewhere. 

Rule 1 is required for a code for correcting single unrestricted errors 
since any digit must be correctable in one of p — 1 ways. This implies 
a minimum of n(p — 1) + 1 states for the corrector, one for each cor- 


IIA 
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rection and one for the correct message. When m check digits are used, 
p” corrector states are obtained. 

Rule 2 and Convention 2 are the same for the single small error cor- 
rection systematic codes. The same reasons apply for both cases. 

Convention 1 is changed from the equivalent convention for the small 
error correction code, because the magnitude of the error, not only its 
sign, must be derivable for a code for correcting single unrestricted 
errors. 

An encoder first encodes the message according to (82), where C;; 
represents the jth digit of the characteristic of x; , 


> C550; = 0 mod b. (32) 


The decoder calculates the corrector using the following formula where 
z;’ represents the received value of a ; 


>, Ci = c; mod b. (33) 


The decoder then examines the digits of the corrector in order. The 
first digit which is not 0 shows the magnitude, d, of the error. All digits 
are then divided by d (provided d # 0). (That division is unique, as 
shown by (30).) The result of this division is the characteristic of the 
incorrect digit, which is then corrected by subtracting d. 

Consider a code for correcting a single unrestricted error in a six digit 
message for a base 5 channel: 


5” — 1 


: 4 


IA 


(34) 


A value of 2 for m will satisfy equation (84). The characteristics are 14, 
13, 12, 11, 10 and 01, the last two being check digit characteristics, for 
41, U2, %3, 4, Xs, and 2 respectively. Here, x , %2, %3, and 2%, are in- 
formation digits. The encoding formulas are: 


ty + ve + v3 + vy = —25 mod 5, (35) 
Av, + 32. + 2x3 + x4 = —2— mod 5. (36) 


The decoding and correcting formulas are: (x,’ is the received value of 
Xi) 


ay’ + eo + 03" + ag! + a5’ 
Avy! + 820’ + 243’ + a4! + a5’ 


¢, mod 5, (37) 
c, mod 5. (38) 


The corrector 1s CC . 
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Suppose that a message 221321 is received as 224321. Then: 
c, = 13 = 3 mod 5, (39) 
Co = 26 = 1 mod 5. (40) 


To find the characteristic of the digit, x, , that was incorrectly received 
from the value of the corrector, (41) and (42) must be solved: 


d Chr = ( = 3 mod OQ, (41) 
d Che = Co = 1 mod a. (42) 


Because the first non-zero digit of any characteristic is 1, (41) can be 
solved for d since Cy, = 1. This yields the result, d = 3. Using this result, 
(42) is solved for Ciz ; by inspection, Che = 2, since 3-2 = 6 = 1 mod 5. 
Thus the characteristic of the incorrect digit, Cr Che, 1s 12, and the 
error d, is 3; 23’ must therefore be reduced by 3 to get the correct value. 
Since the message was received with 23’ too high by an amount 3, this 
result confirms our expected correction. 

Any correction that is applied must be applied on a modulo b basis. 
For example, if a correction of —2 is indicated on a digit whose re- 
ceived value is 1, 1 — 2 = 4 mod 5, which means that the digit is cor- 
rected to 4. 

Codes of this type are restricted in their construction. No mixed digits 
may be used, and the number base must be prime. For the case of 
n = [(p’ — 1)/(p — 1)] + 1, g + 1 check digits are required [see (31)]. 
This means that the number of information digits for a message of 
this length is the same as for a message one digit shorter, which requires 
only g check digits. A comparable binary case is the Hamming Code 
example of an eight binary digit message (four information digits) 
compared with a seven digit message (also four information digits). In 
the binary case, the extra digit 1s useful for double error detection, but 
unfortunately, this is not the case for non-binary codes. 


4.2 Composite Number Base, Single Unrestricted Error Correcting Code 


The problem of correcting an unrestricted error on a single digit, 
working with a number base b, that is not a prime is much more difficult. 
Many relatively inefficient techniques exist. For example, characteristics 
containing only binary numbers (0 and 1) might be used; (this would 
amount to using the Hamming Code directly). This 1s obviously inefh- 
cient since the corrector associated with any single digit error of amount 


1364 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1957 


d, would contain only the digits 0 and d, thus wasting most of the pos- 
sible corrector values.* 

It is possible to encode and decode using the prime factors of the 
number base, performing separate and independent corrections on each 
factor. This is also inefficient, since for many cases, information as to 
which digit is in error is found independently in two or more ways, while 
for certain values of the error, it can be found in only one way. Working 
with mixed digits and check bases, a lower than b, is not satisfactory 
since certain values of the error (@ in particular) will never show up in a 
particular check. The technique used for primes will not work since 
multiples of two different characteristics may be identical; for example, 
base 10, characteristics 11 and 18, error 5, will both yield correctors of 55. 

Another technique that is relatively efficient is, however, available. 
It involves performing all check, encoding and decoding operations in a 
number base p, where p is some prime number (usually, the lowest) 
that is equal to or greater than b. (In case b is a prime, we use the pro- 
cedure outlined above, which is a special case of the procedure to be 
described below.) 

The obvious difficulty in such a procedure is that while the informa- 
tion channel can only handle b levels, the check digits may assume p 
levels, corresponding to the required p check states. This dilemma can 
be resolved by adding an adjustment digit. The object of this digit is to 
permit check information to be transmitted in a base greater than }, 
the channel base. The idea of an adjustment digit can best be illus- 
trated by an example. Supposefora decimal channel, checks are performed 
in a unodecimal (base 11) code. Let y represent the value corresponding to 
ten. (The consecutive integers in a unodecimal system are then 0, 1, 2, 
3, °°: ,9, y, 10, 11, --: , 19, ly, 20, etc.) Suppose in a particular mes- 
sage, four check digits, 21 , 22, 22 , 24, calculated modulo 11 from decimal 
information digits are used, whose values are 1, 0, y, 8. A fifth digit, 
Zo is added such that the sums modulo 11 of 2: + 20, 22 + 20, 23 + 2, 
za + 2 are kept constant at 1, 0, y, 8 respectively. There are eleven dif- 
ferent words satisfying the condition: [1, 0, y, 8] = [(é: + 20), (22 + 20), 
(zg + 2), (24 + 20)]. These are shown in Table VI. Of these words, six do 
not contain the digit y, and so may be transmitted over a decimal chan- 
nel. Thus, an adjustment digit permits check digits which are calculated 
in a number system of a higher base than 6, to be transmitted over a base 
b channel. When an adjustment digit is used in base p for adjusting m 
digits so that transmission over a channel in base 0 is possible, a mini- 


* A waste of corrector values is equivalent to an excessive number of check 
states for a message, which in turn implies an excessive number of check digits. 
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mum of b — m(p — b) states are allowed for the adjustment digit. (lor 
certain values of the check digits, more states could be allowed, but a 
code for utilizing these extra states becomes unwieldy.) For the case 
b = 10, p = 11, this turns out to be 10 — m. At least one state must 
be available for each adjustment digit, to have a workable code. 

The characteristic of an adjustment digit is determined in the follow- 
ing way: if an adjustment digit adjusts the jth check digit, then the jth 
digit of the characteristic of the adjustment digit 1s 1; otherwise, it is 0. 
The characteristic of all other digits may be derived using the rules de- 
scribed above for the prime number base channel, except that p, the 
prime number base of the code must be used instead of 6, the number 


TABLE VI — ILLUSTRATION OF ADJUSTMENT DIGIT 


20 21 22 23 e4 
0 1 0 ¥y 8 
] 0 y 9 7 
2 y 9 8 6 
3 9 8 7 5 
4 8 7 6 4 
5 7 6 5 3 
6 6 5 4 2 
7 5 4 3 1 
8 4 3 2 0 
9 3 2 1 y 
y 2 1 0 9 


base of the channel, for generating characteristics. A message 1s initially 
encoded using a value of 0 for an adjustment digit. Subsequently, if the 
adjustment digit always has at least g allowable states, it may be used 
to transmit one additional information digit, base g, of information. If 
the value of this information digit is y, the (y + 1)st lowest possible value 
of the adjustment digit (making the lowest value equivalent to y = Q) 
meeting the requirement that all adjusted check digits are no greater 
than b — 1 is transmitted. The adjustment digit in conjunction with 
its associated check digits conveys a digit, base q, of information. 

In the example given above, q = 6 and if y 1s 4, the fifth lowest value 
of z , 7, is transmitted. The lowest value must be associated with y = 0. 
The values of 2021222324 that are sent over the decimal channel are 75431. 

An example of such a code is one using a decimal channel working in a 
unodecimal base for the purposes of encoding and error correction. The 
word length, n, is twelve, nine decimal information digits, one octal (base 
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8) information digit associated with the adjustment digit, and two check 
digits. The characteristics are the following: 


a1 ly v5 16 Xy 12 
x2 19 ae 15 a9 11 (adjustment digit) 
x3 18 v7 14 tu 10 (check digit) 
v4 17 xg 13 v2 O1 (check digit) 
Let zu and 212 represent the values of the check digits a and wp, 
originally derived from 21, %2, °°* , Ug, %: 
ty + ao tay + +++ + x = —2y mod 11, (43) 
yt, + Ove + 8x3 + +--+ + 2x = —2. mod 11. (44) 


From zy and zi. , the ten different words (0, 21 , 212), (1, 2u — 1, 22 — 1), 
(2, 2 — 2, ae — 2), --- , (9, en — 9, 212 — 9) are formed. If y is the 
value of the octal information digit, the (y -+ 1)st such word, that does 
not contain the digit y, is selected and transmitted as the last three 
digits of the message. For example, if 2. = 2, 212 = 1 and y = 6, the ten 
words are (0, 2, 1), Ge ie 0), (2, 0, Ys (3, Y) 9), (4, 9, 8), (5, 8, 7), (6, 4, 6), 
(7, 6, 5), (8, 5, 4), (9, 4, 3); the word (8, 5, 4) is selected since it is the 
seventh in the sequence that does not contain any y’s. Table VII shows 
the choice of the three last digits as a function of y, given 21, = 2, 212 = 1. 

Formula (45) 1s used for calculating the corrector. Let C;; represent 
the jth digit of the characteristic of 2;, c; the jth digit of the cor- 
rector, and x,’ the received value of x; . Then, 


12 


c; = >, C,,-' mod 11. (45) 
i=l 
The translation from corrector to correction is the same as if the original 


TABLE VII — RELATION BETWEEN ApbDJUSrED DIGIT AND 
ASSOCIATED INFORMATION 


<2 
R 
S 
& 
= 
= 
wo 


“IT S> Or He CO DD = © 
MOMATIMD OIL © 
H= Or Go sT OO Cor bh 
WR a1o Oe 
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message had been in a unodecimal code. (This has been illustrated in 
Section 4.1.) | 

The first step of the encoding procedure is to calculate the unadjusted 
check digits. Next, the adjusted check digits and adjustment digit are 
selected according to the value of y, the information digit associated with 
the adjustment. The message is then ready for transmission. 

At the decoder, the message is first corrected as if it had been re- 
celved as a unodecimal message. The information digits are then in 
their corrected states. Next, the adjustment digit and the check digits 
are examined and the inverse of the encoding process used to select a 
particular set of check and adjustment digits is used to reconstruct the 
value of y which originally controlled the selection. In the example given 
above, the values of a1 , Yu , V2 are 8, 5, 4 respectively; the decoder recog- 
nizes that this is the seventh lowest value of x1) , which means that the 
value of y, used in selecting 21) and the adjusted values of xy and 212, 
was 6. 

The code described above is fairly efficient; about 90 per cent of the 
corrector values can be associated with corrections; the product of the 
information states and the check states is about 97 per cent of the 
total number of states of a twelve decimal digit word. Each of the above 
factors reduces the efficiency of the code below a possibly unattainable 
maximum. It will be noted, however, that this reduction is relatively 
small in both cases, and is very much lower than would be the case for 
any of the rejected schemes. The scheme is not difficult to instrument; 
relatively little additional equipment is required in addition to the 
basic equipment for mstrumenting a simple prime number base chan- 
nel, unrestricted single error correcting code system. . 

The method of adjustment digits is general and can be used for de- 
riving a single error correction code for correcting unrestricted errors 
for any channel base. Any convenient prime check base, p, at least as 
great as b may be used, although the lowest will generally be the most 
efficient. The only requirements which must be fulfilled are that the 
number of states of the adjustment digit must be at least 1, and that at 
least two check digits must be associated with each adjustment digit. 
An adjustment digit associated with m check digits, working with a 
channel baseb anda check base p, may have b — m(p — b) different states. 


V. SINGLE ERROR CORRECTION, DOUBLE ERROR DETECTION CODES FOR 
CORRECTING SMALL ERRORS 


Single error correction, double error detection codes are very useful 
in situations where a message may occasionally be repeated. In order 
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for a correction code to be reasonably useful in a system with random 
noise or errors, the errors must be relatively infrequent, which makes 
double errors still more infrequent. If means are available for an occa- 
sional but very infrequent repetition of a message, a single error correc- 
tion, double error detection code will increase the reliability of a digital 
system, since a message may be repeated if a double error is recognized. 

This section will show how the ideas of the single error correction, 
double error detection Hamming Code may be combined with the ideas 
of semi-systematic single small error correction codes (described in Sec- 
tion IIT) to derive simple and efficient codes for correcting single small 
errors and detecting double small errors. | 

In order to derive a simple characteristic code for correcting single 
small errors, and detecting double small errors, a set of characteristics 
must be found having the property that the sum or difference of two 
characteristics or their complements or double the value of one charac- 
teristic or 1ts complement be distinguishable from the value of any 
single characteristic or its complement. The sum of two characteristics 
represents the value of the corrector for a message with two errors of 
+1, +1, the difference represents two errors of +1, —1, the sum of 
their complements represents two errors of —1, —1; double a charac- 
teristic represents an error of +2, and double a complement represents 
an error of —2. To have a true single error correction, double error de- 
tection code for small errors, all these cases must be distinguished from 
the case of a single error or no error by making certain that the value of 
the corrector for any of these cases is different than the value of the 
corrector corresponding to any single error and no error. 

Table VIII gives the characteristics used in the single error correction 
Hamming Code and the single error correction, double error detection 
Hamming Code for conveying four digits of information in a message 
containing seven or eight binary digits respectively. 

An inspection ot Table VIII shows that the sum (performed without 
- earries from column to column) of any two characteristics in the right 
part of the table is distinguished by having at least one 1 in the first 
three places and a 0 in the last place. This distinguishes it from any 
single characteristic since all characteristics have a 1 in their last place. 

Some difficulties arise in trying to adopt such a scheme directly in a 
non-binary system. For the code to be efficient, an over-all check would 
have to be performed using a mixed digit; only two check states are 
required for an over-all parity check, and if b > 3, (b representing the 
number base of the channel) at least two information states are pos- 
sible. But the over-all check digit, which performs a binary check, is not 
checked by any other digit. This means that although errors might be 
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detected in an over-all check digit, difficulties would be encountered in 
determining the direction of the correction, so that the information 
conveyed by the mixed digit could be used. Actually, means are avail- 
able, for accomplishing an adaptation of binary techniques. These meth- 
ods are described in Section VII but they are less straightforward than 
the ones described below. 

For channels with base b, greater than 3, at least one check may be 
made using a check base, a» , that 1s 4 or greater. If characteristics are 
used whose last digit (the digit associated with the a» check) is always 1, 
and whose only other limitation is that each characteristic is different 
from every other characteristic, a satisfactory code is obtained. Single 
errors are corrected in the normal way. If the last digit of the corrector 
is 1 or am — 1, the error is +1 respectively on the digit whose charac- 


TABLE VIII — CHARACTERISTICS FOR HAMMING CODES 


Single Error 


Single Error Correction Double 


Correction Error Detection 
001 Check Digit X1 0011 
010 Check Digit Xo 0101 
011 Information Digit X3 0111 
100 Check Digit X4 1001 
101 Information Digit X5 1011 
110 Information Digit X6 1101 
111 Information Digit X7 1111 


Over-all Check Digit X5 0001 


teristic or whose characteristic complement is indicated by the cor- 
rector. If the last digit of the corrector is 2 or a» — 2, or the last digit is 0 
and other digits are not all 0, a double error is indicated. If the entire 
corrector is made up of 0’s, the message is correct as received. 

An example is a code for a ten digit message, decimal base channel; 
eight decimal information digits, one mixed digit conveying binary 
message information (such as the sign of the decimal number) and qua- 
ternary (base 4) check information, and one check digit are transmitted 
in each message. Let 2, and x2 represent the mixed and check digit re- 
spectively, x; through x1 the information digits, y, the binary informa- 
tion conveyed by 2; , and 2; the quaternary check information conveyed 
by 2, . The encoding formulas are: 


2x3 + 32, + 4x5 + 525 + 6x7 + 72x + 8x9 + 9x10 = —X2mod 10, (46) 
to + a3 + tat Us + Xe + Ar + te +X + 4% = —%mod4, (47) 
t= 2+ 4y. (48) 


I 
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Note that (46), (47) and (48) must be applied consecutively, in that 
order, since (47) cannot be applied without knowing x2 obtained oH 
(46), aid (48) requires 2; , obtained from (47). 

The characteristics are Ol, 11, 21, 31, 41, 51, 61, 71, 81, 91 respec- 
tively; the complements of the penne abe are 03, 93, 33, 73, 63, 53, 
43, 33, 23, 13 respectively. The corrector, cice, 1s calculated at the 
decoder by the following formulas (2, is the received value of 2;): 


10 
cq = >) (xe) —1)mod10 (49) 
i=? 
10 
» = > x,’ mod 4 (50) 
1=1 


Consider the example of a message with decimal information digits 
3 75 2 0 6 5 2 and binary information digit 1. Then xv. = 3, 24 = 3, 
and y,; = 1, yielding a value of 7 for 2, . The message is sent as 7 3 3 7 
5 2 0 6 5 2. Suppose that the sixth digit is changed to 1 in transmis- 
sion. Then the corrector has a value 53; this is the complement of the 
characteristic of the sixth digit and indicates that the sixth digit should 
be incremented by 1 according to the rules previously stated. If the 
sixth digit had been received as 1 and the seventh digit also received as 1 
(an error of +1), then the corrector value would be 10, indicating a 
double error (see rules stated above). 

If a multiple of 4 is used as a, , the last digit of a characteristic may 
assume all odd values below a,,/2. The rule then is that an even value of 
the last digit of the corrector, or a O for the last digit and other digits 
of the corrector not all 0, indicates a double error. 

The following set of rules and conventions may be used with any 
base b 2 4, and any length of message, for deriving a set of charac- 
teristics for a semi-systematic code for correcting single small errors and 
detecting double small errors. Since the conventions restrict the eff- 
ciency of the code, it is conceivable that a different set of conventions 
will yield a more efficient code in some cases; (51) may be modified 
through the use of an alternate set of conventions. 

Rule 1. No two digits may have identical characteristics. 

Convention 1. Choose for a» a multiple of 4. Let an/4 = g. 

Convention 2. The characteristic of the mixed digit associated with 
Qm contains a single 1 in the last position; the rest of its digits are 0. 

Convention 8. The characteristics of the 7th mixed or check digit con- 
tains a 1 in the last position, a 1 in the 7th position and 0’s elsewhere. 

Convention 4. The characteristic of an information digit has an odd 


- NON-BINARY ERROR CORRECTION CODES 1371 


number less than a,,/2 in its last position. The rest of its digits are 
arbitrary. 

Convention 5. The above conventions restrict the choice of charac- 
teristics. In order to have n distinct characteristics, m mixed or check 
digits, using check bases a1, a2, -** , @m, are required, and inequality 
(51) must be satisfied: 


N S ayaa... ‘Am—-1°g- (51) 


Codes may be derived using the above conventions only if b 2 4. 
For the ternary case, a relatively efficient code may be obtained by 
using one ternary digit as an over-all parity check digit. The rest of the 
message is in a single small error correction code, derived using the 
rules and conventions of Section III. Any single small error will lead to a 
failure of the parity check, and a double small error will lead to a failure 
of other checks but not the parity check. 

No general solution has been found for deriving an efficient single 
error correction double error detection code for the unrestricted error 
case. Also, no general solution has been found for deriving an efficient 
multiple error correction code for the unrestricted error case. A reason- 
ably efficient method has been found for correcting multiple errors in 
the more important small error case; this is discussed in Section 6.2. 


VI. THE USE OF BINARY ERROR CORRECTION TECHNIQUES IN NON-BINARY 
SYSTEMS 


In this section, methods for using binary codes for the correction of 
errors in a non-binary system are described. Although the single small 
error correction codes obtained in this manner are generally less flexible 
than the codes obtained in Section ITI, the class of multiple error correc- 
tion codes described in Section 6.2 is the only reasonably satisfactory 
class of such codes that has been found. The codes described in this 
section are semi-systematic but are not simple characteristic codes. 


6.1 Single Small Error Correction Codes 


Binary codes are most conveniently used for correcting small errors 
(+1). Suppose any digit, base b, has an associated pair of binary digits, 
arranged in such a way that a change of +41 in the base 6 digit will 
change only one of the two binary digits. For 6 = 10, an association 
such as the one shown in Table IX might be used. For example, if a 
6 is received as a 7, the associated binary message would indicate 
that the second of the binary digits is incorrect; a 7 can be corrected 
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TABLE LX — AssocriaTgep Binary DiciIts FoR CORRECTION 
OF SMALL ERRORS 


Decimal Digit Associated Binary Digits 


WOON OORPWNre © 
i) 
— 


TABLE X — REFLECTED QUIBINARY CODE 


Decimal Digit Quinary Component Binary Component _ [Associated Binary Digits 


0 0 0 00 
1 0 1 Ol 
2 1 1 11 
3 ] 0 10 
4 2 0 00 
4) 2 1 01 
6 3 1 11 
7 3 0 10 
8 4 0 00 
9 4 1 01 


to an 8 or a 6, but only the correction to 6 would correspond to a 
change in the second binary digit of the associated binary message. 

Ii the first of the associated binary digits is the odd or even indication 
of a quinary component of a decimal digit, a decimal digit can convey 
ten states rather than the four states of the associated binary digits. 
The combination of binary and quinary digits shown in Table X may 
be called a reflected quibinary code because of its analogy with the re- 
flected binary code.* 

If a method were available for transmitting without error (e.g., by 
using an error correcting code) a message composed of the associated 
binary digits in a base b code, small errors could be corrected in the 
base 6 digits. 

An examination of Table X for resolving a decimal digit into binary 
and quinary components, reveals that a change of +1 on any decimal 


* The refiected binary code has the property that each increment changes only 
one binary digit; for example, the eight successive words of a three mae digit 
reflected binary code are 000, 001, 011, 010, 110, 111, 101, 100. 
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digit will change only one of these two components. Further, an error 
corresponding to a change in the quinary component can be uniquely 
corrected if the error in the decimal digit is assumed to be +1. Tor 
example, if a received 6 is discovered to have an incorrect quinary com- 
ponent, only a decrease in the quinary component making the decimal 
digit 5 1s a possible correction, since an increase in the quinary com- 
ponent would correspond to the decimal digit 9, a change of more than 
+1 from 6. 

A system is shown in Fig. 2 for taking advantage of these properties. 
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Fig. 2 — Use of binary codes with a decimal channel. 
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In this example, an information source generates n quinary and n — m 
binary information digits for each message. All quinary digits go through 
an odd or even recognition circuit to be converted into binary digits for 
the purpose of generating a binary error correction code message. These 
binary digits and the binary digits generated by the information source 
are fed into a systematic binary error correction code encoder whose 
output 1s a binary message containing 2n digits, of which m are parity 
check digits. This output is divided into two parts, 2n — m original 
inputs to the encoder unchanged by the encoding process (this is a sys- 
tematic encoder which does not change information digits in encoding), 
and m parity check digits. | 

The m parity check digits are then combined with m of the quinary 
information digits through the use of the reflected quibinary combiner 
to form m of the decimal digits of the decimal message that is trans- 
mitted; the other decimal digits are formed by combining the n — m 
binary information digits with the rest of the quinary information 
digits. 

The decimal message is transmitted over the noisy channel and arrives 
with one or more (a number limited by the choice of the binary code) 
errors of -+-1 on decimal digits. It is fed into a reflected quibinary resolver 
which resolves decimal digits into binary and quinary components in 
accordance with the reflected quibinary code (Table X). The quinary 
digits are then fed into an odd or even recognition circuit to form binary 
digits; these and the binary outputs of the resolver are fed into a binary 
decoder and corrector, working with the same code as the binary en- 
coder. The output of this corrector should correspond to the output of 
the original binary encoder. 

In the decoder, the binary digits are corrected. When the binary digit 
derived from a quinary digit is corrected, however, the quinary digit is 
not yet correct. The correction of the quinary digit is performed by 
examining both the corrected binary digit derived from the quinary 
digit and the corrected binary digit which was derived from the same 
decimal digit as the quinary digit in question. The rules for correcting 
the quinary digit are given in Table XI. 

As an example, consider the application of a Hamming Code for 
transmitting ten binary digits in a fourteen binary digit message. 

Using a code of this type, single errors of +1 may be corrected in a 
seven digit decimal message, transmitting seven quinary digits of in- 
formation and three binary digits of information. The characteristics 
required for a fourteen binary digit Hamming Code message are shown 
in the first column of Table XII, 
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TABLE XI — CorRECTING QUINARY DIGITS 


Correction of 


Q Bi Be Quinary Digit 
Iiven 0 0 None 
Even 0 1 None 
Even 1 0 —] 
Even 1 1 +] 
Odd 0 0 +1 
Odd 0 1 —] 
Odd ] 0 None 
Odd 1 J None 


TABLE XII — BINARY CODE USED FOR CORRECTING 
DECIMAL MESSAGE 


Binary b 


Eliieectacier cs Position in Decimal Message 


0 0 0 11} Parity Check Digt (0) (0) | Binary comp. of Ist digit 
0 0 1 O} Parity Cheek Digit (0) (O) | Binary comp. of 2nd digit 
001 1 1 1 Binary comp. of 8rd digit 
0 1 0 0] Parity Cheek Digit (1) (1) |} Binary comp. of 4th digit 
0101 0 0 Binary comp. of 5th digit 
0 11 0 0 0 Binary comp. of 6th digit 
0111 1 3 Quinary comp. of 7th digit 
1 0 O O} Parity Check Digit (1) (1) | Binary comp. of 7th digit 
1 00 1 1 3 Quinary comp. of 6th digit 
1 01 0 0 2 Quinary comp. of 5th digit 
1011 0 4 Quinary comp. of 4th digit 
1 1 0 0 1 1 Quinary comp. of 3rd digit 
1101 1 3 Quinary comp. of 2nd digit 
1 11 0 0 0 Quinary comp. of Ist digit 


To illustrate the method completely, a strictly binary example will 
first be illustrated, then a related decimal example. In column a of Table 
XII, the digits of a binary message are indicated and in column J, the 
binary and quinary information digits. The values of the parity check 
digits, which are shown in parentheses, are calculated by the usual for- 
mula. Let C;; represent the jth digit of the characteristic of the zth 
digit (including parity check digits): 

14 

>> 2:03 = 0 mod 2. (52) 

t=] 
This formula applies for all values of 7 and in this case will yield four 
implicit equations each with one unknown term, the value of the parity 
check digit. Using the given values of the binary information digits, 
the values of the parity check digits are calculated. These are shown in 
parentheses in Table XII. 
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The binary message is 
0011001110011 0. 


Tor this example, the quinary components (quinary information digits) 
of decimal digits are chosen odd if the corresponding digit of the binary 
example is 1, even if that digit is 0. The binary and quinary components 
are then combined by the rules of the reflected quibinary code to form 
the decimal digits 0 7 2 9 4 7 6. For example, the quinary and binary 
components of the fifth digit are 2 and 0, respectively; the decimal digit 
which has these components is 4, the fifth decimal digit of the message. 

Consider the binary case. Suppose that the message is mutilated in 
transmission so that the tenth digit is received incorrectly. The message 
is mutilated from 


O0110011100110 
to 
0011001111011 0. 


The decoder and corrector calculates the corrector by 
14 
c; = >) aiCi; mod 2. (53) 
w=1 


In this formula, c; is the jth digit of the corrector and x,’ the received 
value of x; . In this example the corrector is 1 0 1 0, which means that 
the tenth digit, which has this characteristic, is wrong and should be 
changed to 0. 

The corresponding error in the decimal example is a change in the 
fifth digit from 4 to 3. If the message 0 7 2 9 3 7 6 is received, the 
resolver and quinary to binary converter delivers the message 


00110011110110 
to the decoder instead of 
001100111200110 


corresponding to the correct message. The corrected binary message is 
produced at the output of the decoder and corrector. When the quinary 
and binary components of the fifth digit are examined by the quinary 
correction circuit, the following inputs exist: 


Received quinary digit 1 (Odd) (quinary component of 
| received decimal 3) 
Corrected binary digit 

derived from quinary 0 (B,) 
Corrected binary digit 

from same decimal number _—O (B). 
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Table XI shows that the quinary digit must be increased by 1 to 2, 
which combined with the binary 0 conveyed by the same decimal digit 
yields a decimal value of 4, the original transmitted value. 

The best semi-systematic simple characteristic code for coreecine 
single small errors in a seven digit message allows 6 X 10° possible mes- 
sages In a seven digit message (see Table V), whereas this code allows 
6.25 X 10°. This code is therefore slightly more efficient. In addition, 
this code has the special advantage that any error of +2 on one digit 
is recognizable since the corrector will have a value of 1111 for the asso- 
ciated binary message. (An inspection of the choice of characteristics 
and assignment of characteristics to the two components of any decimal 
digit will confirm this.) 

This general technique can be applied to any base b channel, provided 


TaBLE XIII — Components oF QuinaRy DiaITs 


Mixed Digit Information Digit 


Quinary Digit Info. Comp. Check Comp. Quinary Digit Binary Comp. | Ternary Comp. 
0 0 0 0 0 0 
1 0 1 1 I 0 
2 1 I 2 I 1 
3 1 3 0 I 
4 not used not used 4 0 2 


* Tf quinary information is initially generated, the combination (1, 2) will not 
occur. 


that b is greater than 3. For odd bases, the digits which convey a parity 
check component and an information component cannot be utilized effi- 
ciently since one state of the base b digit is not available. For example, 
using a base 5, (see Table XIII), only two information and two parity 
check states may be conveyed by one digit, since the use of a third infor- 
mation state would require at least six states for the mixed digit. In 
the case of information digits, however, all states can be used. In the 
quinary example, the resolution of a digit into two components and the 
subsequent recombination is subject to the restraint that one of the com- 
binations (1, 2) will not occur, which can be assured if the information 
source generates quinary digits. 

For the case of high redundancy codes having the property that the 
associated binary code contains more than 50 per cent parity check 
digits (corresponding to a negative value of n — m in Fig. 2), at least 
some of the base 6 digits must convey two or more parity check digits. 

This can be easily accomplished: a decimal digit can convey three 
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TABLE XIV — Decimau Diagir CoNveYING THREE Binary Dicits 


Decimal Digit Binary Components 
0 0 0 0 
1 0 0 1 
2 Oho. vf 
; 3 010 
4 1 1 0 
5 1 1 I 
6 1 O 1 
7 1 0 0 
8 not used 
9 not used 


parity check digits if a simple reflected binary code correspondence be- 
tween binary and decimal digits is maintained as shown in Table XIV. 

An extension of this idea is the encoding of the original information 
(i.e., the information that 1s shown coming out of the information source 
in Fig. 2) in some error detection or correction code. For example, the 
decimal to reflected quibinary code resolver will cause both components 
to be incorrect if an error of +2 in a decimal digit occurs. In this case, 
the system shown in Fig. 2 will automatically make a correction on the 
decimal digit of either +2 or —2 depending upon the value of the re- 
ceived decimal digit, and provided a double error correction binary 
code is used. Such a correction will be incorrect about half the time. If 
the received binary digit 1s compared to the corrected binary digit and 
the received quinary digit 1s compared to its corrected odd or even digit, 
an error of +2 can be detected without changing the code. If one extra 
binary check digit, treated as an information digit by the encoder and 
decoder, is transmitted in the message, this binary digit can convey the 
information necessary for determining the sign for a correction of +2, 
provided that only one such correction is required for any one message. 
A rule for determining the value of this digit is: 


B. = 0 if > @ = (0 or 1) mod 4, 
i=l 
(54) 


ae | if 2, 4 


w=1 


(2 or 3) mod 4, 


where q; represents the 7th quinary information digit, and B, represents 
the special check digit. If the received message contains one error of +2 
on a digit, two possible corrections may be made on the quinary compo- 
nent of this digit; +1. Obviously, only one of these corrections will 
satisfy the equation for determining B, since the two possible corrected 
values of g are two units apart. 
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Note that the associated binary codes for performing such a correc- 
tion must have the property that two binary digits may be corrected 
since an error of +2 corresponds to incorrect values for two associated 
binary digits. If the noise is such that errors of +2 are not very unlikely, 
it may be desirable to place the binary and the quinary components of 
any one decimal digit in a different binary error correction code word so as 
to make the errors independent. In a seven decimal digit message, as an 
example, the quinary components of the first four decimal digits can be 
used to generate parity check digits which are conveyed by the binary 
components of the last three decimal digits. The binary component 
of the fourth decimal digit (this might be B.) and the quinary com- 
ponents of the last three decimal digits generate parity check digits 
conveyed by the binary components of the first three decimal digits. 
Two separate binary error correction code messages are then conveyed 
by a single seven digit decimal code message. Each message is in a four 
information digit, three parity check digit Hamming Code. Through the 
use of this code, one error in the binary component of any decimal digit, 
and one error in the quinary component of any decimal digit may be 
corrected. | 

In certain cases, the quinary digits themselves might be encoded in an 
error correction code for single unrestricted errors before the binary 
process is carried out. This is helpful chiefly for occasional large errors, 
leading to initial miscorrections. | 

The variations based upon the principles described, which can be 
applied to any channel, provided b = 4, including the pyramiding of one 
code scheme upon another, are almost endless. Generally, the last 
encoding and first decoding step should be able to correct many more 
errors than the first encoding step. For example, 1f quinary components 
are encoded in single unrestricted error correction quinary code, the bi- 
nary code should probably be a triple or quadruple error correction code; 
otherwise a correction may not correspond to the most probable error 
condition, and the correction scheme loses its effectiveness. 

These techniques cannot be conveniently applied to the ternary chan- 
nel, since a ternary digit cannot be resolved into two components effi- 
ciently. 


6.2 Multiple Small Error Correction Codes 


One limitation of the above techniques is the requirement for a sys- 
tematic binary code; i.e., a code in which some of the binary information 
digits are transmitted directly, and others are determined by parity 
checks on information and previously calculated check digits. These 
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TABLE XV — RErED-MULLER CopEs — 256 Dicir MESSAGE 


Number of Digits of Information per Message Number of Errors Correctable per Message 

255 0 

247 ] 

219 3 

(163 é 

93 15 

37 3l 

9 63 

1 127 


systematic codes are conveniently applicable only to the correction of 
single errors and a few special cases of multiple errors. 

The Reed-Muller” codes are not systematic codes, (“systematic” 
being used in the narrow sense indicated above, not in the sense of Ham- 
ming’), but offer the advantage that multiple error correction is rela- 
tively straightforward. For this reason, it is desirable to find some way 
of adapting the binary Reed-Muller codes for correcting a number of 
small errors in non-binary codes. 

To explain the nature of the Reed-Muller codes completely is beyond 
the scope of this paper; a list of their important features is sufficient. 
This is: 

1. The length of a message is 2” binary digits for the simpler versions 
of the code. 

2. If C.’ represents the number of combinations of d items taken 
cata time, and C,’ = d!/[c\(d — c)!], then 2° — >°7 5 Cg_; information 
digits may be transmitted correctly in a message containing 2” digits, 
if no more than 2” — 1 errors occur in the messages; 2” errors are de- 
tected but they are not always correctable. The Reed-Muller codes for 
correcting a large number of errors will frequently correct more than 
2” — J errors, and will always correct 2" — 1 or fewer errors. 

These values are given for a 256 digit message in Table XV. 

3. Each digit of the transmitted message is a parity check of a group 
of digits from the information source; the message cannot be broken down 
into information digits and check digits. 

4. The decoding is accomplished by a number of majority decisions 
among different groups of message digits. | 

A technique will be described for using a Reed-Muller code efficiently 
to correct a number of small (+1) errors for any code base b that is a 
multiple of 2, and also, at a small sacrifice of efficiency, a number of larger 
errors. 

A theorem, stating that any code which is generated by a set of parity 
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checks will contain the same set of allowable messages as some systematic 
code, was proved by Hamming.” In particular, such a theorem indicates 
that a Reed-Muller code will contain the same set of allowable messages 
as some systematic code. This was also proved by Slepian,” who has given 
a simple method of deriving a systematic code generating the same set 
of messages as a Reed-Muller code. For convenience, such a code will be 
called an SERM code (Systematic Equivalent Reed-Muller code). 

A Reed-Muller decoder serves to derive the information digits from a 
message in Reed-Muller code which may have been mutilated by noise. 
Ii a Reed-Muller decoder is followed by a Reed-Muller encoder, the com- 
bination serves as a noise eliminator (provided the noise is within the 
correction bounds of the code), since the output of the encoder is the 
noiseless Reed-Muller code message that is equivalent to the noisy 
message that entered the decoder. This property is useful since it means 
that any message, drawn from the set of Reed-Muller code messages, 
which has not been mutilated outside the bounds set up by a particular 
Reed-Muller code, will be restored to its original form, by a Reed-Muller 
decoder followed by a Reed-Muller encoder. Since an SERM code will 
produce only messages included in the set of messages of the correspond- 
ing Reed-Muller code, the SERM code can be used in conjunction with a 
Reed-Muller decoder and encoder to permit transmission over a noisy 
channel in a systematic code. 

The two systems shown in Fig. 3 are therefore equivalent in their 
error correction properties. In both cases, messages from the set of Reed- 
Muller code messages are sent, and since the same decoder is used int- 
tially, both systems will correct errors in the received message in the 
same manner. The Reed-Muller encoder in the second system is re- 
quired because a Reed-Muller decoder does not correct a message but 
derives information digits from the received message directly. The 
derived information digits, however, necessarily correspond to some 
corrected form of thereceived message and, in effect, the decoder performs 
the same correction as it would perform by deriving the corrected form of 
the message first. 


INFORMATION] REED MULLER NOISY _| REED MULLER [INFORMATION 
To pee ENCODER CHANNEL DECODER —> 
INFORMATION; SERM NOISY RM RM DIGIT INFORMATION 
ae ENCODER CHANNEL DECODER ENCODER SELECTOR ae 


Fig. 3 — Equivalent systems using SERM and Reed-Muller codes. 
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TABLE XVI — MULTIPLE SMALL ERROR CORRECTION CODE 
Usinec SERM Coprs witwH DEcIMAL CHANNEL 


Information Digits _ Check Digits No. of Small E 
Message Length cas aa Correctable. sepa 
128 127.7 3 0 
128 125.3 Pe | ] 
128 116.9 11.1 3 
128 100.0 28.0 7 


This means that a Reed-Muller code can be adapted to the system 
shown in Fig. 2. The Systematic Binary Error Correction Code Encoder 
is simply an SERM encoder; this is permissible since the SERM codes 
are systematic. The Binary Decoder and Corrector is simply a Reed- 
Muller decoder followed by a Reed-Muller encoder. Everything else 
remains unchanged. 

This scheme offers flexibility for the correction of large numbers of 
small errors. Proper initial error correction encoding of the original in- 
formation digits will permit correction of a small number of large errors. 

Table XVI shows some typical cases of the correction of many small 
errors In a decimal message as a function of the number of information 
and check digits in a message of constant length. For convenience, every- 
thing is shown in equivalent decimal digits, even though in the actual 
code, binary and quinary information digits are used. Only the first few 
entries are considered, since the message composed exclusively of the 
digits 0, 3, 6, 9 in which any number of small errors in a decimal channel 
may be corrected (this code is described by the first entry of Table V) is 
more efficient than the codes corresponding to subsequent entries on 
Table XVI. This code, which is very easy to instrument, will transmit 
the equivalent of 77 decimal digits in a 128 decimal digit message. 

One problem not efficiently solved by these techniques is the multiple- 
error correction ternary channel problem. A technique which can be 
used is a code identical to the regular binary Reed-Muller Code, except 
that all equations will be modulo 3 instead of modulo 2. In decoding, this 
will sometimes require subtraction instead of addition; in modulo 2 
equations there is no difference between these operations, but in modulo 3 
equations, the two operations are distinct. The same procedure can be 
used for correcting multiple unrestricted errors in any base. 


VII. ITERATIVE CODES 


All the codes described above have one disadvantage; occasional ex- 
cessive noise will yield a non-correctable message. In order to approach 
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error free transmission, some iterative coding procedure may be used: 
This problem has been solved by Elias.!® His methods are directly appli- 
cable to non-binary codes, since nothing restricts the digits to ety 
values. 

In order to minimize the complexity of an iterative coding precedes 
systematic codes are desirable. The advantages of the Reed-Muller code 
are significant however, especially for the case of a relatively noisy. 
channel. A sound procedure for a binary channel would therefore be to 
use SERM codes, (see Fig. 3); such codes are more efficient than iterated 
Hamming Codes in a relatively noisy channel. : 


VIII. SUMMARY AND ANALYSIS 


Many codes have been presented in this paper, all constructed by 
some combination of procedures involving linear congruence or modulo 
equations. 

In most cases, more efficient codes exist. Exhaustive procedures exist 
for deriving maximum efficiency codes, although the codes derived in 
this manner usually require an extensive codebook, both at the encoder 
and at the decoder. Even for simple single error correction binary codes, 
the most efficient code is not always a systematic code. For example, 
the best systematic single error correction binary code working with an 
eight digit message has only 16 different allowable messages; it is known"™ 
that a non-systematic code with at least 19 allowable messages exists. 

In the case of non-binary codes, the situation is somewhat worse. 
Very few of the codes given in this paper take advantage of the fact that, 
for most situations, a digit that is incorrectly received as 0 or b — I is 
usually corrected only in one direction and no need exists to specify 
whether the correction is +1. Most of the codes are arranged so that 
any received digit may be corrected either positively or negatively. No 
codes have been found which take full advantage of such a property, 
other than codebook codes, except for isolated instances of short message 
codes having symmetrical properties. For example, the single digit, 
single small error correction decimal code having 0, 3, 6, 9 as the allow- 
able messages takes full advantage of this property, and is, at the same 
time, a true semi-systematic code. 

It is extremely difficult to find the ultimate limits df efficiency of code- 
book codes. The exhaustive procedures are totally impractical except for 
very short messages. If an analysis is restricted to codes which do not 
take advantage of the property that certain values of digits may be 
corrected in only one direction, and it is assumed that each possible 
message 1s mutilated to the same number of incorrect messages, one 
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limit to the efficiency of codes may be found. This limit can be derived 
from the fact that an error correction code decoder and correction cir- 
cuit must be able to convert any message which contains errors within 
the bounds of the correction performed by the code, into the value of 
the message as originally transmitted, or must be able to derive the 
original information which was fed into the encoder. Thus, if each mes- 
sage may be mutilated in w ways, and still be corrected, then at least w 
messages must be associated with each allowed message. This is indi- 
cated diagrammatically in Fig. 4. The messages produced by the encoder 
are shown at the left; each one fans out to w — 1 mutilated messages 
plus the original message. The decoder converts any of these w messages 
into the original message. 

The value of w can be determined by taking all possible combinations 
of errors that can be corrected by a coding system. For example, for a 
code system which can correct up to (d — 1)/2 small errors in different 
— digits in an n digit message, w is given by 
(d—1)/2 


w= Dd) OF 2, (55) 


1=0 
where d is the minimum distance between messages, and 


n\ 
(n — t)!2! 

This equation merely signifies that w is the sum of all combinations of 

positive and negative (accounting for the 2 term) errors in up to 

(d — 1)/2 different digits out of n digits. For single errors, w = 2n + 1. 
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Fig. 4 — Graphical representation of an error correction code. 
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The number of different messages that can be produced by the en- 
coder must be no greater than b"/w, subject to the above restriction, 6” 
representing the maximum number of messages that the decoder may 
receive as an input. If only systematic and semi-systematic codes are 
considered, the number of messages is limited to multiples of powers of 
b and of the information component base @ of mixed digits. The number 
of check states must be at least as large as w, so that w different correc- 
tors may be calculated and associated with w different corrections. 

Subject to the above restrictions, the following statements may be 
made. | 

1. The systematic single small error correction codes derived using 
the rules of Section III are the most efficient systematic single small 
error correction codes possible. For those codes in which the two sides 
of inequality (8a) are equal, no code, not even a non-systematic code, is 
more efficient. 

2. The systematic single unrestricted error correction codes derived 
using the rules of Section 4.1 are the most efficient systematic single 
unrestricted error correction codes. For those codes in which the two 
sides of inequality (31) are equal, no code is more efficient. 

3. No codes are more efficient than those semi-systematic codes, 
derived using the rules of Section III, for which the two sides of in- 
equalities (28) and (29) are equal and m, = 0. It is difficult to make 
more general statements about semi-systematic codes, because spe- 
cial techniques (such as those of Section VJ), not all of which are known, 
may be used with these codes. 

For multiple error correction codes, other techniques are bea simpler 
and more efficient than the straight systematic and semi-systematic 
techniques described in Sections JII, IV and V. One such scheme has 
been described in detail in Section VI. No codes have been found which 
approach the limit set by w, but the codes described in Section 6.2 are 
moderately efficient. 

Throughout this paper, all techniques which involve vast complica- 
tions at the expense of slight additional efficiency have been avoided. 
Codebook methods are always possible. If a technique is almost as com- 
plicated as a codebook technique with only slightly greater efficiency 
than a simple technique, the simple technique would always be used in 
practice, and the codebook satisfies the mathematical and theoretical 
requirements. In a sense, a really complicated technique is only useful 
for deriving a better lower limit for the maximum efficiency of a code- 
book code. In the non-binary case, however, a codebook system is con- 
siderably more efficient than any code system which does not take ad- - 
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vantage of the fact that all transmitted messages are not mutilatable to 
an equal number of correctable received messages. : 

Irom the point of view of deriving lower limits to the maximum effh- 
ciency of a codebook technique, such a consideration is vital. Except for 
a few relatively trivial cases, no codes have been found which take sig- 
nificant advantage of the above consideration, for deriving such a 
limit.* 


IX. CONCLUSION 


In this paper, techniques have been presented for deriving error cor- 
rection codes for non-binary systems. None of the methods presented 
are overly complicated, nor do they require excessive storage capacity 
for either the encoding or decoding and correction system. 

The codes are sufficiently simple so that their use with a non-binary 
storage system may be considered, and the development of such a 
storage system should not be stopped because a system without flaws or 
not subject to noise cannot be realized. 

An important disadvantage of using error correction codes with such 
a system is the time requirement. Correction usually requires a signifi- 
cant amount of time. This is probably one reason why the Hamming 
Code is not used more extensively. The more advanced and complicated 
codes, such as the Reed-Muller Codes, suffer particularly from the 
amount of time required for a correction. The codes described in this 
paper are therefore probably best suited to medium or low speed stor- 
ages, which are not read too frequently. 

A study of this type may be of some interest to those who have been 
considering the use of multi-state devices for building switching systems 
and computers, since this paper represents a study of a typical problem. 
Certain lessons may be derived from this study: 

1. Restriction to a single number base for all operations 1s a severe 
handicap. The more advanced codes presented in this paper, require 
extensive use of different number base operations. The ability, inside 
the computer, to change number bases for different operations, may well 
be useful. 

2. Different problems are best solved using different number bases. 
For example, the use of an even number base is desirable for multiple 
small error correction codes, while the use of a prime number base 1s 
desirable for correcting single large errors. It is the author’s opinion that 

* Note that this restriction has less significance in the case of binary codes. In 


a symmetrical channel with only two available signals, each value of a digit may 
be changed in as many ways, namely, one, as every other. 
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number bases which are the product of several small factors are best. 
Suggested values are six, ten and twelve. Number bases with two differ- 
ent prime factors, may offer an advantage, since they permit simple 
translation and change of number base among at least three different 
numbers. 

In the comparison between binary and non-binary error correction 
codes, the following observations may be made: 

1. Keeping the amount of information per message fixed, a binary 
single error correction code is less efficient than a non-binary single 
small error correction code, provided b, the channel base, is greater than 
three, but is more efficient than a non-binary single unrestricted error 
correction code. 

2. Non-binary codes are slightly more complicated to implement than 
binary codes; this applies to multiple error correction codes as well as to 
single error correction codes. The amount of added complication is in no 
case really extensive. 

It was initially hoped that this study might also produce some addi- 
tional binary error correction techniques. One such technique was dis- 
covered: the use of a systematic equivalent Reed-Muller code to ap- 
proach error free coding (see Section VIT). 

Finally, the author wishes to express the hope that further work on 
non-binary systems will be encouraged by this study. 
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Shortest Connection Networks 
And Some Generalizations 
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The basic problem considered is that of interconnecting a given set of 
terminals with a shortest possible network of direct inks. Simple and prac- 
tical procedures are given for solving this problem both graphically and 
computationally. It develops that these procedures also provide solutions 
for a much broader class of problems, containing other examples of practical 
interest. 


I. INTRODUCTION 


A problem of inherent interest in the planning of large-scale communi- 
cation, distribution and transportation networks also arises in connec- 
tion with the current rate structure for Bell System leased-line services. 
It is the following: 

Basic Problem — Given a set of (point) terminals, connect them by a 
network of direct terminal-to-terminal links having the smallest possible 
total length (sum of the link lengths). (A set of terminals is ‘‘connected,”’ 
of course, if and only if there is an unbroken chain of links between every 
two terminals in the set.) An example of such a Shortest Connection Net- 
work is shown in Fig. 1. The prescribed terminal set here consists of 
Washington and the forty-eight state capitals. The distances on the par- 
ticular map used are accepted as true. 

Two simple construction principles will be established below which 
provide simple, straight-forward and flexible procedures for solving the 
basic problem. Among the several alternative algorithms whose validity 
follows from the basic construction principles, one is particularly well 
adapted for automatic computation. The nature of the construction 
principles and of the demonstration of their validity leads quite naturally 
to the consideration, and solution, of a broad class of minimization prob- 
lems comprising a non-trivial abstraction and generalization of the basic 
problem. This extended class of problems contains examples of practical 
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Fig. 1 — Example of a shortest connection network. 
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interest in quite different contexts from those in which the basic prob- 
lem had its genesis. 


II. CONSTRUCTION PRINCIPLES FOR SHORTEST CONNECTION NETWORKS 


In order to state the rules for solution of the basic problem concisely, 
it 18 necessary to introduce a few, almost self-explanatory, terms. An 
isolated terminal is a terminal to which, at a given stage of the construc- 
tion, no connections have yet been made. (In Fig. 2, terminals 2, 4, and 
9 are the only isolated ones.) A fragment is a terminal subset connected 
by direct links, between members of the subset. (In Fig. 2, 8-3, 1-6-7-5, 
5-6-7, and 1-6 are some of the fragments; 2-4, 4-8-3, 1-5-7, and 1-7 are 


2 
a O 


5 


Fig. 2 — Partial connection network. 


not fragments.) The distance of a terminal from a fragment of which it 
is not an element is the minimum of its distances from the individual 
terminals comprising the fragment. An tsolated fragment is a fragment 
to which, at a given stage of the construction, no external connections 
have been made. (In Fig. 2, 8-3 and 1-6-7-5 are the only isolated frag- 
ments.) A nearest neighbor of a terminal is a terminal whose distance 
from the specified terminal is at least as small as that of any other. A 
nearest neighbor of a fragment, analogously, is a terminal whose distance 
from the specified fragment is at least as small as that of any other. 

The two fundamental construction principles (P1 and P2) for shortest 
connection networks can now be stated as follows: 

Principle 1— Any isolated terminal can be connected to a nearest 
neighbor. | 

Principle 2— Any isolated fragment can be connected to a nearest 
neighbor by a shortest available link. 

For example, the next steps in the incomplete construction of Fig. 2 
could be any one of the following: 

(1) add link 9-2 (PI applied to Term. 9) 
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(2) add link 2-9 (P1 applied to Term. 2) 

(8) add link 4-8 (P1 applied to Term. 4) 

(4) add link 8-4 (P2 applied to frag. 3-8) 

(5) add link 1-9 (P2 applied to frag. 1-6-7-5). | 
One possible sequence for completing this construction is: 4-8 (P1), 8-2 
(P2), 9-2 (P1), and 1-9 (P2). Another is: 1-9 (P2), 9-2 (P2), 2-8 (P2), 
and 8-4 (P2). 

As a second example, the construction of the network of Fig. 1 could 
have proceeded as follows: Olympia-Salem (P1), Salem-Boise (P2), Boise- 
Salt Lake City (P2), Helena-Boise (P1), Sacramento-Carson City (P1), 
Carson City-Boise (P2), Salt Lake City-Denver (P2), Phoenix-Santa Fe 
(P1), Santa Fe-Denver (P2), and so on. 

The kind of intermixture of applications of P1 and P2 demonstrated 
here is very efficient when the shortest connection network is actually 
being laid out on a map on which the given terminal set is plotted to 
scale. With only a few minutes of practice, an example as complex as 
that of Fig. 1 can be solved in less than 10 minutes. Another mode of 
procedure, making less use of the flexibility permitted by the construc- 
tion principles, involves using P1 only once to produce a single frag- 
ment, which is then extended by successive applications of P2 until the 
network is completed. This highly systematic variant, as will emerge 
later, has advantages for computer mechanization of the solution proc- 
ess. As applied to the example of Fig. 1, this algorithm would proceed 
as follows if Sacramento were the indicated initial terminal: Sacramento- 
Carson City, Carson City-Boise, Boise-Salt Lake City, Boise-Helena, 
Boise-Salem, Salem-Olympia, Salt Lake City-Denver, Denver-Cheyenne, 
Denver-Santa Fe, and so on. 

Since each application of either Pl or P2 reduces the total number 
of isolated terminals and fragments by one, it is evident that an N-ter- 
minal network is connected by N-1 applications. 


III. VALIDATION OF CONSTRUCTION PRINCIPLES 


The validity of Pl and P2 depends essentially on the establishment 
of two necessary conditions (NCI and NC2) for a shortest connection 
network (SCN): 

Necessary Condition 1— Hvery terminal in a SCN is directly con- 
nected to at least one nearest neighbor. 

Necessary Condition 2 — Kvery fragment in a SCN is connected to at 
least one nearest neighbor by a shortest avaclable path. 

To simplify the argument, it will at first be assumed that all distances 
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between terminals are different, so that each terminal or fragment has 
a single, uniquely defined, nearest neighbor. This restriction will be 
removed later. 

Consider first NC1. Suppose there is a SCN for which it is untrue. 
Then [Fig. 3(a)| some terminal, ¢, in this network is not directly joined 
to its nearest neighbor, n. Since the network is connected, ¢ is necessarily 
joined directly to some one or more terminals other than n — say f,, 
--+,f,. For the same reason, n 1s necessarily joined through some chain, 
C’, of one or more links to one of fi , ---, fy — say to f; . Now if the link 
¢ — f,, isremoved from the network and the link t — n is added [Fig. 3(b)], 
the connectedness of the network is clearly not destroyed — f, being 
joined to ¢ through n and C, rather than directly. However, the total 
length of the network has now been decreased, because, by hypothesis, 
¢ — nisshorter than? — f, . Hence, contrary to the initial supposition, the 
network contradicting NCI could not have been the shortest, and the 
truth of NC1 follows. From NC1 follows Pl, which merely permits the 
addition of links which NCI shows have to appear in the final SCN. 

Turning now to NC2, the above argument carries over directly if ¢ 
is thought of as a fragment of the supposed contradictory SCN, rather 
than as an individual terminal — provided, of course that the ¢ — n link 
substituted for ¢ — f;, is the shortest link from n to any of the terminals 
belonging to ¢. From the validity of NC2 follows P2 — again the links 
whose addition is permitted by P2 are all necessary, by NC2, in the 
final SCN. 

The temporary restrictive assumption that no two inter-terminal 
distances are identical must now be removed. A reappraisal of the 
proofs of NCI and NC2 shows that they are still valid if n is not the . 
only terminal at distance ¢ — n from t, for in the supposedly contradictory 
network the distance ¢ — f, must be greater than t — n. What remains to be 
established is that the length of the final connection network resulting 





(b) 


Fig. 3 — Schematic demonstration of NCI. 
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from successive applications (V — 1 for N terminals) of Pland P2 is 
independent of which nearest neighbor is chosen for connection at a _ 
stage when more than one nearest neighbor to an isolated terminal or 

t is available. This is a consequence of the following considera- 
tions: For a prescribed terminal set there are only a jfinete number of 
connection networks (certainly fewer than Cy‘4”” — the number of 
distinct waysof choosing VN — 1 links from the total of N(NV — 1)/2 possible 
links). The length of each one of this finite set of connection networks is 
a continuous function of the individual interterminal distances, d;; (as a 
matter of f ct, it is a linear function with coefficients 0 and 1). With 
the d,; specified, the length, L, of a shortest connection network is 
simply the smallest length in this finite set of connection network 
lengths. Therefore L is uniquely determined. (It may, of course, be 
associated with more than one of the connection networks.) Now, if at 
each stage of construction employing P1 and P2 at which a choice is to 
be made among two or more nearest neighbors m1, :--, n, of an isolated 
terminal (or fragment) ¢, a small positive quantity, e, is subtracted from 
any specific one of the distances din, , ++, din, — Say from din, — the 
construction will be uniquely determined. The total length, L’, of the 
resulting SCN for the modified problem will now depend on e, as well 
as on the d;; of the original terminal set. The dependence on e will be 
continuous, however, because the minimum of a finite set of continuous 
functions of ¢e (the set of lengths of all connection networks of the modi- 
fied problem) is itself a continuous function of «. Hence, as ¢ is made 
vanishingly small, L’ approaches L, regardless of which “nearest neigh- 
bor” links were chosen for shortening to decide the construction. 


IV. ABSTRACTION AND GENERALIZATION 


In the examples of Figs. 1 and 2, the terminal set to be connected was 
represented by points on a distance-true map. The decisions involved 
in applying P1 and P2 could then be based on visual judgements of 
relative distances, perhaps augmented by application of a pair of di- 
viders in a few close instances. These direct geometric comparisons can 
of course, be replaced by numerical ones if the inter-terminal distances 
are entered on the terminal plot, as in Fig. 4(a). The application of P1 
and P2 goes through as before, with the relevant ‘“‘nearest neighbors’”’ 
determined by a comparison of numerical labels, rather than by a 
geometric scanning process. For example, Pl applied to Terminal 5 of 
Fig. 4(a) yields the Link 5-6 of the SCN of Fig. 4(b), because 4.6 is 
less than 5.6, 8.0, 8.5, and 5.1, and so on. 
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When the construction of shortest connection networks is thus reduced 
to processes involving only the numerical distance labels on the various 
possible links, the actual location of the points representing the various 
terminals in a graphical representation of the problem is, of course. 
inconsequential. The problem of Fig. 4(a) can just as well be represented 
by Fig. 5(a), for example, and Pl and P2 applied to obtain the SCN 
of Fig. 5(b). The original metric problem concerning a set of points in 
the plane has now been abstracted into a problem concerning labelled 
graphs. The correspondence between the terminology employed thus 
far and more conventional language of Graph Theory is as follows: 

terminal <> vertex 

possible link <> edge 

length of link — “length” (or “‘weight’’) of edge 

connection network <> spanning subgraph 

(without closed loops) <> (spanning subtree) 





L=17.6 3) 
(a) (b) 


Fig. 4 — Example of a shortest spanning subtree of a complete labelled graph. 





L176 


(a) (b) 


Fig. 5 — Example of a shortest spanning subtree of a complete labelled graph. 
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shortest connection network <> shortest spanning subtree 

SCN — SSS 
It will be useful and worthwhile to carry over the concepts of ‘‘fragment”’ 
and “nearest neighbor” into the graph theoretic framework. P1 and P2 
can now be regarded as construction principles for finding a shortest 
spanning subtree of a labelled graph. 

In the originating context of connection networks, the graphs from 
which a shortest spanning subtree is to be extracted are complete graphs; 
that is, graphs having an edge between every pair of vertices. It is 
natural, now, to generalize the original problem by seeking shortest 
spanning subtrees for arbitrary connected labelled graphs. Consider, for 
example, the labelled graph of Fig. 6(a) which is derived from that of 
Fig. 5(a) by deleting some of the edges. (In the connection network 
context, this is equivalent to barring direct connections between certain 
terminal pairs.) It is easily verified that NC1 and NC2, and hence Pl 
and P2, are valid also in these more general cases. For the example of 
Fig. 6(a), they yield readily the SSS of Fig. 6(b). 

As a further generalization, it 1s not at all necessary for the validity 
of Pl and P2 that the edge “lengths” in the given labelled graph be 
derived, as were those of Figs. 4-6, from the inter-point distances of 
some point set in the plane. P1 and P2 will provide a SSS for any con- 
nected labelled graph with any set of real edge “lengths.” The “lengths’’ 
need not even be positive, or of the same sign. See, for example, the 
labelled graph of Fig. 7(a) and its SSS, Fig. 7(b). It might be noted in 
passing that this degree of generality 1s sufhcient to include, among 
other things, shortest connection networks in an arbitrary number of 
dimensions. 

A further extension of the range of problems solved by Pl and P2 
follows trivially from the observation that the maximum of a set of 





L=23.8 
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Fig. 6 — Example of a shortest spanning subtree of an incomplete labelled graph. 
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real numbers is the same as the negative of the minimum of the negatives 
of the set. Therefore, Pl and P2 can be used to construct a longest 
spanning subtree by changing the signs of the “lengths” on the given 
labelled graph. Fig. 8 gives, as an example, the longest spanning subtree 
for the labelled graph of Figs. 4(a) and 5(a). 

It is easy to extend the arguments in support of NCi and NC2 from 
the simple case of minimizing the sum to the more general problems of 
minimizing an arbitrary increasing symmetric function, or of maximizing 
an arbitrary decreasing symmetric function, of the edge ‘‘lengths” of a 
spanning subtree. (A symmetric function of n variables is one whose 
value is unchanged by any interchanges of the variable values; e.g., 


ty tate tes + +4, %1X2 °° + X,,8In 2a, + sin 2%. +--+ + sin 2z,, 
(xP + ve + +--+ + 2,3)", etc.) From this follow the non-trivial generali- 
zations. 


‘The shortest spanning subtree of a connected labelled graph 
also minimizes all increasing symmetric functions, and maxi- 
mizes all decreasing symmetric functions, of the edge “lengths.” 





(a) (b) 


Fig. 7 — Example of a shortest spanning subtree of a labelled graph with 
some “lengths’’ negative. 





Fig. 8 — The longest spanning subtree of the labeled graph of Figs. 4(a) and 
5(a). | 
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The longest spanning subtree of a connected labelled graph 
also maximizes all increasing symmetric functions, and mini- 
mizes all decreasing symmetric functions, of the edge “lengths.” 


For example, with positive ‘“‘lengths’”’ the same spanning subtree that 
minimizes the sum of the edge ‘“‘lengths” also minimizes the product and 
the square root of the sum of the squares. At the same time, it maximizes 
the sum of the reciprocals and the product of the are cotangents. 

It seems likely that these extensions of the original class of problems 
soluble by Pl and P2 contain many examples of practical interest in 
quite different contexts from the original connection networks. A not 
entirely facetious example is the following: A message is to be passed 
to all members of a certain underground organization. Each member 
knows some of the other members and has procedures for arranging a 
rendezvous with anyone he knows. Associated with each such possible 
rendezvous — say between member ‘2’? and member ‘7’? — is a certain 
probability, p:;, that the message will fall into hostile hands. How is 
the message to be distributed so as to minimize the over-all chances of 
its being compromised? If members are represented as vertices, possible 
rendezvous as edges, and compromise probabilities as “length” labels 
in a labelled graph, the problem is to find a spanning subtree for which 
1 — II(1 — p,;) is minimized. Since this is an increasing symmetric 
function of the p;;’s, this is the same as the spanning subtree minimiz- 
ing D> p;; , and this is easily found by P1 and P2. 

Another application, closer to the original one, is that of minimizing 
the lengths of wire used in cabling panels of electrical equipment. Re- 
strictions on the permitted wiring patterns lead to shortest connection 
network problems in which the effective distances between terminals 
are not the direct terminal-to-terminal distances. Thus the more general 
viewpoint of the present section 1s applicable. 


V. COMPUTATIONAL TECHNIQUE 


Return now to the exemplary shortest connection network problem 
of Figs. 4(a) and 5(a) which served as the center for discussion of the 
arithmetizing of the metric factors in the Basic Problem. It is evident 
that the actual drawing and labelling of all the edges of a complete 
graph will get very cumbersome as the number of vertices increases — 
an N-vertex graph has (1/2)(N? — N) edges. For large N, it is convenient 
to organize the numerical metric information in the form of a distance 
table, such as Fig. 9, which is equivalent in content to Fig. 4(a) or Fig. 
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5(a). (A distance table can also be prepared to represent an incomplete 
labelled graph by entering the length of non-existent edges as «.) 
When it is desired to determine a shortest connection network directly 
from the distance table representation — either manually, or by machine 
computation — one of the numerous particular algorithms obtainable 
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5.7 | 5.6 | 3.6 
(3) | (1) | () 





Fig. 10 — Illustrative computational determination of a shortest connection 
network. 
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by restricting the freedom of choice allowed by P1 and P2 is distinctly 
superior to other alternatives. This variant is the one in which P1 is 
used but once to produce a single isolated fragment, which is then ex- 
tended by repeated applications of P2. 

The successive steps of an efficient computational procedure, as ap- 
plied to the example of Fig. 9, are shown in Fig. 10. The entries in the 
top rows of the successive /’ tables are the distances from the connected 
fragment to the unconnected terminals at each stage of fragment growth. 
The entries in parentheses in the second rows of these tables indicate 
the nearest neighbor in the fragment of the external terminal in question. 
The computation is started by entering the first row of the distance 
table into the Ff table (to start the growing fragment from Terminal 1). 
A smallest entry in the F table is then selected (in this case, 2.8, asso- 
ciated with External Terminal 4 and Internal Terminal 1). The link 1-4 
is deleted from the F table and entered in the Solution Summary (Tig. 
11). The remaining entries in the first stage ¥ table are then compared 
with the corresponding entries in the ‘‘4” row of the distance table 
(reproduced beside the first F’ table). If any entry of this ‘‘added ter- 
minal’’ distance table is smaller than the corresponding IF’ table entry, 
it.is substituted for it, with a corresponding change in the parenthesized 
index. (Since 3.4 1s less than 5.2, the 3 column of the / table becomes 
3.4/(4).) This process is repeated to yield the list of successive nearest 
neighbors to the growing fragment, as entered in Tig. 11. The F and 
“added terminal” distance tables grow shorter as the number of un- 
connected terminals is decreased. 

This computational procedure is easily programmed for an automatic 
computer so as to handle quite large-scale problems. One of its advan- 
tages is its avoidance of checks for closed cycles and connectedness. 
Another is that it never requires access to more than two rows of distance 
data at a time—no matter how large the problem. 


SOLUTION SUMMARY 
LINK LENGTH 








1-4 2.8 
4-3 3.4 
1-6 3.6 
o-—2 3.2 
ees 4.6 


—_ 


Fig. 11 — Solution summary for computation of. Fig. 10. 
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VI. RELATED LITERATURE AND PROBLEMS 


J. B. ISruskal, Jr.t discusses the problem of constructing shortest 
spanning subtrees for labelled graphs. He considers only distinct and 
positive sets of edge lengths, and is primarily interested in establishing 
uniqueness under these conditions. (This follows immediately from NC1 
and NC2.) He also, however, gives three different constructions, or 
algorithms, for finding SSS’s. Two of these are contained as special 
cases in Pl] — P2. The third is — “Perform the following step as many 
times as possible: Among the edges not yet chosen, choose the longest 
edge whose removal will not disconnect them’? While this is not directly 
a special case of Pl — P2, it is an obvious corollary of the special case 
in which the shortest of the edges permitted by Pl — P2 1s selected at 
each stage. Kruskal refers to an obscure Czech paper’ as giving a con- 
struction and uniqueness proof inferior to his. 

The simplicity and power of the solution afforded by Pl and P2 for 
the Basic Problem of the present paper comes as something of a surprise, 
because there are well-known problems which seem quite similar in 
nature for which no efficient solution procedure is known. 

One of these is Steiner’s Problem: Find a shortest connection network 
for a given terminal set, with freedom to add additional terminals 
wherever desired. A number of necessary properties of these networks 
are known’ but do not lead to an effective solution procedure. 

Another is the T'raveling Salesman Problem: Find a closed path of 
minimum length connecting a prescribed terminal set. Nothing even 
approaching an effective solution procedure for this problem is now 
known (early 1957). 
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A Network Containing a Periodically 


Operated Switch Solved by 


Successive Approximations 


By C. A. DESOER 


(Manuscript received June 15, 1956) 


This paper concerns itself with the analysis of a type of periodically 
switched network that might be used in time multiplex systems. The econom- 
ics of the situation require that the ratio of the switch closure tume 7 to the 
switching period T be small. Using this assumption, the analysis is performed 
by successive approximations. More precisely the zeroth approximation to 
the transmission ts obtained from a block diagram analogous to those used 
in sampled servomechanisms. From the convergence proof of the successive 
approximation scheme, tt follows that when r/T is small, the zeroth approxt- 
mation ts very close to the exact transmission. A discussion of some examples 
as included. 
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I. INTRODUCTION 


One main contributor to the cost of transmission circuits is the trans- 
mission medium itself. Thus it is important to share the transmission 
medium among as many messages as possible. One possible method is 
the frequency multiplex where each message utilizes a different frequency 
band of the whole band available in the medium. An alternate method 
is the time multiplex where each message is assigned a time slot of dura- 
tion 7 and has access to that time slot once every 7’ seconds. It is obvious 
that the economics of the situation requires that 7 be as small as possible 
and T as large as possible so that the largest possible number of messages 
are transmitted over the medium. For this very reason the analysis of 
periodically switched networks is of special interest in the case where 
7/T is small. 

W.R. Bennett* has published an exact analysis of this problem without 
any restrictions either on the network or on the ratio 7/7’. It is believed, 
however, that the analysis presented in this paper will, in most practical 
cases, give the desired answer with a considerable reduction in the 
amount of calculations. The simplification of the analysis is mainly a 
result of the assumption that 7/T' is small. 

First the successive approximation method of solution will be discussed 
in general terms. Next it will be shown that the zeroth approximation 
to the transmission through the network can be obtained from the gain 
of a block diagram analogous to those used in the analysis of sampled 
servomechanisms. The nature of the zeroth approximation is further 
clarified by some general discussion and some examples. Next it 1s shown 
that the successive approximations converge. The convergence proof then 
suggests some slight modifications of the block diagram to obtain a more 
accurate solution. 


II. DESCRIPTION OF THE SYSTEM 


The system under consideration is shown on Fig. 1. It consists of two 
reactive networks N; and Nz connected through a switch S which is it- 
self in series with an inductance ¢. N; is driven at its terminal pair (1) 


N, Ir No 
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Fig. 1 — System under consideration. 
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Fig. 2 — Resonant circuit. 


by a current source J») which is shunted by a one ohm resistor. N2 is also 
terminated at its terminal pair (1) by a one ohm resistor R, which is the 
load resistor of the system. The switch S is periodically closed for a dura- 
tion 7. The switching period is 7’. Thus if the switch is closed during the 
interval (0, 7) it will be closed during the intervals (n7T, nJT + 7) for 
n = 1, 2, 3,---. The inductance ¢ is selected so that the series circuit 
shown on Fig. 2 has a resonant frequency f, = 1/27; i.e., the time 7 
during which the switch is closed is exactly one-half period of the circuit 
of Fig. 2. | 

The switch S acts as a sampler and, as a result of the well-known modu- 
lating properties of sampled systems, the sampling period 7’ must be 
chosen such that the frequency 1/27 is larger than any of the frequencies 
present in the signals generated by Jp . Furthermore, in order to eliminate 
all the sidebands generated by the switching, Nz must have a high in- 
sertion loss for all frequencies above 1/27 cps. 

In the analysis that follows networks NV; and N»2 will be assumed to be 
identical: it should, however, be stressed that this assumption 1s not 
necessary for the proposed method of analysis.* This assumption is 
made because in the practical situation which motivated this analysis 
N, and N2 were identical since transmission in both directions was re- 
quired. | 

In order for the system under consideration to achieve the maximum 
degree of multiplexing, the closure time 7 of the switch will be taken as 
small as practically possible and the switching period 7 as large as pos- 
sible (consistent with the bandwidth of the signals to be transmitted). 
Asa result the ratio 7/7’ is very small, of the order of 10 “ or less in prac- 
tical cases. Consequently the resonant frequency f, of the series resonant 
circuit shown on Fig. 2, is many times larger than any of the natural 
frequencies of Ny; and N. 2. 


* The modifications required for the case where N, is not identical to V2 are 
given in Appendix IV. 
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J 
a 


Fig. 3 — System under consideration when N, and WN: are lossless ladders. 
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The problem is to determine the relation between /,, the voltage 
across R, , and Jo. 


Ill. METHOD OF SOLUTION 


Let us first write the equations of the system. Obviously the equations 
will depend on the exact configuration of the networks N; and Ne. For 
simplicity we shall write them for the case where NV, and WN» are dissipa- 
tionless low-pass ladder networks. As will become apparent later this 
assumption is not essential to the argument. What is essential, however, 
is the fact that both Ni and N»2 should start (looking in from the switch) 
with a shunt capacitor C' and a series inductance L, , the element value 
of L, being much larger than ¢. Using a method of analysis advocated 
by T. R. Bashkow,? we obtain, for the network of Fig. 3, the equations: 


dy 


Ly = = — Ri; — Vo + Rr 


dv : ; 
Co Sut 


doy, 
dt 
di, 
dt 
des 
dt 
di, 
dt 
des 
dt 


Cn 


= tn—1 — i 


es = Un — Co (1.a) 


ln — trA(t) (1b) 


u leo — elAGZ) (Le)? kh (1) 


C i,A(t) — 7%,’ (1.d) 


din’ ’ 
Te = = €3 — Un 


dt 





dv : 7 ‘ 
Cn = tn Vi 


dt 





: I, 
dv.’ 
C1 aE = — i’ 


diy’ 
/ -7 
Ty —_ =—- by =- Ryt1 


dt 
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where 


A(t) = > [uit — kT) — ult — kT — 7)), (2) 


k=—oo 
with u(t) = 1 for t>0, and wt) =0 for ¢< 0. 


This system of linear time varying equations may be broken up into 
three sub-systems 7, , Rk and J,. It is this subdivision that suggests a 
successive approximation scheme that will be shown to converge to the 
exact solution. 

The zeroth approximation is obtained as follows: when the switch is 
closed, i.e., A(t) = 1, the resonant current 7, is much larger than the cur- 
rents 7, and 2,’. Thus, during the switch closure time, 7, and 7,’ are neg- 
lected with respect to 7, in (1.b) and (1.d). Hence when A(t) = 1 the 
system R may be solved for 7,(t), eo(¢) and e3(¢) in terms of the initial 
conditions. The resulting function e.(¢) and given function 7o(t) are then 
the forcing functions of the system J,. The other function e3(¢) is the 
forcing function of the system /,. Under these assumptions, the periodic 
steady-state solution corresponding to an applied current a(t) = Ive’ 
is easily obtained. | 

The zeroth approximation will be distinguished by a subscript ‘0’. 
Thus 7,0(¢) 1s the (steady state) zeroth approximation to the exact solu- 
tion 7,(t). 

The first approximation will be the solution of the system (1), pro- 
vided that during the switch closure time the functions 7,(¢) and 7,'(¢) 
in (1.b) and (1.d) are respectively replaced by the known functions 
tno(t) and 2,0’ (é). And, more generally, the (k + 1)th approximation will 
be the solution of (1) provided that during the switch closure time, the 
functions 7,(t) and 7,’(t), in (1.b) and (1.d), are respectively replaced by 
the known solutions for 2z,(é), and 2,/(¢) given by the kth approximation 
It will be shown later that this successive approximation scheme con- 
verges. Let us first describe a simple method for obtaining the zeroth 
approximation. 


IV. THE ZEROTH APPROXIMATION 


4.1 Introduction 


The problem is to obtain the steady-state solution of (1) under the 
excitation a(t) = Ipe**’. Using the approximations indicated above, 
during the switch closure time (that is when A(¢) = 1) the system FR 
becomes 
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des Se te 
oS = sac, (3) 
¢ Sr = [es — alAO, (a) 
cH = 4a. (5) 
at 


Differentiating the middle equation and eliminating de./dt and de3/dt 
we get forO0 S t < 7: 


Ta — Zia’) +5 [ex(t) — e3(t)]8(é) (6) 


in which we used the notation 6(¢) for the Dirac function and the knowl- 


edge that 
dA) _ 


Equation (6) represents the behavior of the resonant circuit of Fig. 2 
for the following initial conditions: 











i(O+) = 0, (8) 
diO+) — e(0) — e3(0) 
dt ¢ ©) 


In Appendix J it is shown that the resulting current 7,(¢) is, for the in- 
tervalO St <7, 


i,(t) = Cle(O) — e3(0)|si(@), (10) 


where 
w . wt 1 : 
|Zsin™ = _w)sinat for Os t<r 
a()=427 or 2 (11) 
lo elsewhere 
with 


i, ee 
ont a 4/2. (12) 


Thus the zeroth approximation to the exact 2,(¢) is given for the interval 
O0OstsTby 


tro(t) = Cle2(0) — e3(0)]si(2). (13) 
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We shall now show that the zeroth approximation may be conveniently 
obtained from the block diagram of Fig. 4. 


4.2 Description of the Block Diagram 


All the blocks of the block diagram are unilateral and their correspond- 
ing transfer functions are defined in the following. Capital symbols repre- 
sent £-transform of the corresponding time functions, thus Jo(p) 1s the 
£-transform of z(t). 

Referring to Fig. 1, 


E2(p) 
[o(p) \1,=0 


Thus Zi2(p) represents the transfer impedance of Ni; when its output is 
open-circuited (i.e., /- = 0). Since N; and N» are identical we also have, 
from R, = 1 and reciprocity, Ze(p) = H.s/I,, where J, 1s the cur- 
rent entering No. 

The impulse modulator is periodically operated every 7’ seconds, 
and has the property that if its input is a continuous function f(é) its 
output is a sequence of impulses: 


os f(t) o(t — kT). 





Zip) = 


The transfer function S,(p) is defined by 


WO 


2 
= ie 0 Oe ad LL 
Sifp) = £&[s,(t)] Saat cosh 5 (14) 





Let Z(p) be the driving point impedance at the terminal pair (2) of N; . 
It is also that of Ne since N; and N2 are assumed to be identical. 

Let V(p) be the output of the first block, then, by definition, V(p) = 
Z12(p)Io . Let v(t) be the corresponding time function. The voltage v(t) 


IMPULSE 
MODULATOR 





OTe. 
ALL BLOCKS ARE UNILATERAL 


Fig. 4 — Zeroth-approximation block diagram. 
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is the output voltage of N;, when JN; is excited by the current source 
Io and the switch S remains open at all times. 


4.3 Analysis of the Block Diagram 


For simplicity, suppose that the system starts from a relaxed condi- 
tion (i.e., no energy stored) at t = 0. Let z(t) = £& ‘[Z(p)]. Considering 
the network JN, as driven by 7% and 7,0 , it follows that the voltage ee(t) 
shown on Fig. 3 is given by 


eo(t) = v(t) — i | to(t’ a(t — t’) dt’. (15) 
Similarly 
ex(t) = i | tro(t a(t — t) dt’. (16) 
Thus 
exo(t) — eo(t) = v(t) — 2 i | tot a(t — t’) dt’. (17) 


These equations have been derived by considering Fig. 1. They could 
have been also derived from the block diagram of Fig. 4 as follows: let 
T,o(p) be the output of C'S,;(p). As a result, the output of the block 
2Z(p) is 2Z(p)I(p). When this latter quantity is subtracted from V(p) 
one gets V(p) — 2Z(p)I0(p), which isthe &-transform of the right-hand 
side of (17). Referring to the block diagram it is also seen that this 
quantity is the input to the impulse modulator. 

Thus we see that if J,o(p) is the output of C’'S,(p), then the input of 
the impulse modulator is és(¢) — e30(¢) by virtue of (17). If this is the: 
case the output of C'S;(p) is given by Cle0(0) — e30(0)|si(t), for O St < T, 
which, according to (9), is z,o(¢). 

Thus the block diagram of Fig. 4 is a convenient way of obtaining the 
zeroth approximation to the periodic steady-state solution. 

In order to use the techniques developed for sampled data systems,» ° 
we introduce the following notation? If f(t) = &’[F(p)], then we define 
F*(p) by the relation 

+o 


F*(p) = ; a F(p + jnws), (18) 


=—00 


where 


oO, = —. | (19) 
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If f(0+) is defined by lim f(e’), then, provided f(0+) = 0,* 
e>0 


rip) = 2 | & fo ae - nt). (20) 
Going back to the system of Fig. 4 we get’ 
j _ [Z12(p) Lo(p)]*CS1(p) Zi2(p) 
Me) = TF 2CSOZOF = 
and 
Trop) _ [Z12(p) Top) ]*CS1(p) (22) 


1+ 2C(Si(p)Z(p)]*’ 
where according to the notation defined by (18) 


Zaalp)Lo(@)I* = 75 Dy Zilp + jnen)Io(p + ine), 
+00 
[Si(p)Z(p)I* = = ay Silp + jnw.)Z(p + jnw,). 


It should be stressed that (21) and (22) are not valid when 7+ is made 
identical to zero. When 7 = 0, Si(p) = 1 for all p’s and since Z(p) ~ 
1/Cp as p — © the time function whose transform is Z(p)S,(p) is differ- 
ent from zero at ¢ = 0. In such a case (20) does not hold. From a physi- 
cal point of view, the feedback loop of Fig. 41s unstable when 7 is identi- 
cally zero since an impulse generated by the impulse modulator 
produces instantaneously a step at the input of the impulse modulator. 
This step causes an instantaneous jump in the measure of the impulse 
at the output of the impulse modulator and so on. In short the feedback 
loop is unstable. 

It should be pointed out that if the power density spectrum of J» is 
zero for frequencies higher than w,/2, (21) reduces to 


Ego 1 CSi(p)Z12 (p) 


Ws 
i Tresor eS 28) 


For certain applications it is convenient to rewrite (21) in a slightly 
: * When f(0), as defined above, is different from zero, (20) should be replaced 
Vy . 


g » sO) a(t — | = F*(p) +5 $+). 


n=0 
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different form. Advancing the time function s,(¢) by 7/2 seconds, one 
gets the function so(é) which is even in ¢. As a result its transform So(p) 
is purely real, that is, 





2 
So(p) = ~~~ eosh ue 
0 P DO Nein 9 


Irom an analysis carried out in detail in Appendix IV we finally obtain 


[Z12 (p) Lo (p)] *So (p) Z12 (p) 
2|Z(p)So(p)]* 


It should be pointed out that (23) 1s still valid when 7 = 0. Equations 
(20) and (28) give the zeroth approximation to the gain of the system 
for any driving current %(t). 

In many cases it is sufficient to know only the steady-state response 
~ Ey(p) to an input i(f) = Ive’“’. The response F(p), as given by (23) 
for (20)] includes both transient and steady-state terms. Since Jp(p) = 


Lig (p) = (24) 





° — equation (24) gives 
P — J®0 
1 > Z Ih 
— 12(p = jnws) te ee So(p)Z12(p) 
Ew(p) = —= ia wi LE Do . (25) 
2[Z(p)So(p)]* 


Since neither So(p) nor Zi2(p) have poles on the imaginary axis, the 
steady state includes only the terms corresponding to the imaginary 
axis poles of the summation terms. Thus the steady-state response is of 
the form 


> An gn 


where, from (25), | 
A, = fob ral jovo) Sof Jeo az Jnw;)Z32( Jao = 7Nws) . (26) 
2 Do Aljoo + jk — n)w.)Soljen + je — no 


V. TRANSMISSION LOSS 


A practically important question is to find out @ prior whether a 
switched filter necessarily introduces some transmission loss. 

The following considerations apply exclusively to the zeroth order 
approximation. It will be shown that assuming ideal elements, the trans- 
mission at de may have as small a loss as desired. 
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By transmission at dc we mean the ratio of the de component of the 
steady state output voltage to the intensity of the applied direct current. 
Thus we refer to (26) and set w) = 0 and n = O. Suppose the lossless 
networks N; and Ne» are designed so that their transfer impedance Z1. 
is of the Butterworth type, that is 


] 


ito 


| Z12( Jw) ' a 


where for our purposes J is a large integer. 
In the following sum, which is the denominator of (26) when wo. = 
n = 0 


oo 
2 D2 Z (jews) So( jks), 


(where w, > 2 since the cutoff of the networks N occurs at w = 1), the 
terms corresponding to values of k ~ 0 will make a contribution that 
vanishes as M — «. This is a consequence of the following facts: 

(a) Re[Z(jkws)| = | Zio(jkw,) |”, since the networks N; and N» are 
dissipationless. Hence for k ~ Oandas M — ~ Re[Z(jku,)| — 0, 

(b) Im[Z(jke.)] = —Im[Z(—jhe.)], 

(c) So(jw) is real. 

Thus the imaginary part of the products Z(jkw,) So(jkw;) cancel out and 
the real part (for k # 0) decreases exponentially to zero as M > o. 
Hence for sufficiently large M the denominator of (26) may be made as 
close to two as desired. 

It is easy to check that the numerator of (26) reduces to Jo , the in- 
tensity of the applied direct current. Therefore the ratio of Ao, the de 
component of the output voltage to J) may be made as close to one-half 
as desired. 


VI. A SIMPLE EXAMPLE 


Since the approximate formulae derived in Section IV are somewhat 
unfamiliar it seems proper to consider in a rather detailed manner a 
simple example.* 

Consider the system of Fig. 5. Assume that the current source applies 
a constant current to the system and assume that the steady state is 
reached. For simplicity let R = E. 

The steady-state behavior of the voltages e(¢) and e3(t) = e,(t) is 


* In addition, the limiting case of the sampling rate — ©, i.e., 7’ — 0, is treated 
in Appendix II. 
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Fig. 6 — Waveforms of the network of Fig. 5. 


shown on Fig. 6. It is further assumed that the duration 7 during which 
the switch is closed is negligible compared to 7’, the interval between 
two successive closures. | 
Let & be the average value of the steady-state voltage e,(¢). Thus é, 
is equal to Ap as given by (26) with w = n = 0, namely, 
Z127(0) So(0) 


4 = if boo 


25> Z(jku.) So(jhen) 


In this particular case 
R -1 
1+ pRC , * 
p+ RC 
Since we assume 7 to be infinitesimal So(p) and Si(p) may be con- 
sidered equal to unity over the band of interest. Using the expansion 


Z(p) = Zu(p) = 
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1 eo 
cothe =~ + Dis ae 
we obtain 
“too C-1 
0 FE oll mF 
“= (p + jnws) T PG 
Hence 


1 fi 
af = C eoth és ‘ 


IR? RC 1 


1S eee a ES 
1 T fi 7. 
T @ coth (sta) coth (ska) 


This last result obtained from the theory developed above is now going 
to be checked directly. Referring to Fig. 6, where the notation is defined, 
and noting the periodicity of the boundary conditions, we get 

(V1 + Nee — V1 ; ; 

i == Vie We CV Pd), 
Noting that e(#) = (Vi + A)e“”°, and solving for Vi and A we fin- 
ally get 


paps 
T pe» Z(jkos 





Thus finally 


(27) 


po tlRe 
es(t) = [ p @atine SaTTRG 


By definition 


or 


This last equation checks with (27). 


VII. NUMERICAL EXAMPLES 


Consider the network of Tig. 7. The cutoff of both Ni and Ne occurs 
at w = 1. In view of the sampling theorem good transmission requires 
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LOSS IN DECIBELS 





0.10 0.15 0.20 0.30 0.40 0.50 0.60 0.80 1.0 


Fig. 7 — Computed transmission loss. 


that the signal be sampled at a rate at least twice as large as its highest 
frequency component. Since the cutoff occurs at w = 1, the sampling 
angular frequency should at least be equal to 2. For illustration purposes 
we have taken w, = 2.67 and w, = 5 for the angular sampling frequency 
The value w, = 2.67 corresponds to a cutoff at 3 ke and a sampling rate. 
of 8 ke. The ratio 7/T was taken to be 1/125. The transmission through 
the switched network as given by the zeroth approximation is shown for 
both cases on Fig. 7. 

As expected the transmittance of the switched filter gets closer to that 
of an ordinary filter as the switching frequency increases. 


VIII. THE SUCCESSIVE APPROXIMATION SCHEME 


The ideas involved in the successive approximation scheme are simple 
and straightforward. One point remains to be settled, namely the con- 
vergence of the procedure. 

We shall assign a subscript 1 to the correction to be applied to the 
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zeroth approximation in order to obtain the first approximation. ‘Thus 
adding 72,:(t) to 2-o(t) we get the first approximation 7,o(¢) + z(t). More 
generally the kth approximation 1s yan: The procedure will con- 
verge if, in particular, the infinite series >) %—oi;n(t) converges. 


8.1 Preliminary Steps | 


(a) Let us normalize the frequency (and consequently the time) so 
that the switching period 7' is unity. Since the networks N,; and Ne must 
have high insertion loss for w > $(27/T) = 7m, the pass band of Ni and 
N» must be the order of 1 radian/sec. As a result the element values of 
the capacitor C and the inductance L, (see Fig. 3) are also 0(1). 

(b) For the excitation 7 = e*’’, the zeroth approximation derived 
above may be written in terms of Fourier components: 


. “to e 
tro(t) = aes > Loge. 
k=— 0 
foo , 
; : 4 
ino(t) a gee > Tee. 
k=—0 


Let 2, denote the complex conjugate of 2,0, then 
+00 
tro(tiro(t) = : » Lictoie 
»{=—00 


Since the functions ¢7""[k = --- —1, 0,1, ---]| are orthonormal over 
the interval (0, 1) and form a complete set,’ we have from Bessel’s equal- 
ity? 

+00 


i Jo Pat = So [Toa = Nn, 


oa OO 


where N(J,0) denotes the norm of the vector J,o which is defined by its 
components Jro.(k = ++: —1,0, +1.--- ). Similarly, 
+00 


1 
| | tno(d) ° dt = p> | Zno,x ° = N(Ino). 


=—00 


(c) Since the switch is periodically closed we shall be interested in 
the Fourier series expansion of A(¢): 


+00 
A(t) = u(t) — u(t — 7) = DS Ane! 
where Ap = 7 and 


+00 
2d | Ax |? = 7. 
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Since 7/7’ «< 1, and since the frequency has been normalized so that 
T = 1, we have 7 < 1. 
Using the convolution in the frequency domain, we have 


| “Feo “foo 
tno(t)A(t) a >» ( Anelut) gan 


k=—w~o \a=—o 
If we introduce the infinite matrix G defined by 
Gin = Aj_x (2, k oe en ak) ates ica 00 ), 


the convolution may be represented by the product, Glo, where Ino is 
the vector whose components are Jno, x(k = -+- —1,0,1, ---). 

(d) Considering the network shown on Fig. 3, let E(p) be the ratio of 
I,/(p) to I,(p). Taking into account the assumed identity between N; 
and N~» it follows that 


tIn(p)| In’ (p) 
[,.(p)  |ro=0 I,(p) 


Using the system of (1) and, for example, by Neumann series expan- 
sion of the inverse matrix, we get 

1 2 
Lp Lect 

(e) Considering now the effect of 7,(¢) and 7,’(t) on 7,(t), (42) of 
Appendix I gives J,(p) as a function of 7,(p) and J,’(p). In the present 
discussion where we are interested in the steady state of 7,(¢) it is es- 
sential to keep in mind that since the switch opens at t = 7, the memory 
of the resonant circuit extends only over an interval 0 < ¢t S 7. To take 
this into account we must modify the factor (wo /2)/(p” + ap’) of (40), 
because the impulse response (which represents this memory) must be 
identically zero for t > 7. The resulting new expression is 


= E(p). 








E(p) = 


nll 
_ 2 —pr/2 p_pr/2 —pr/2 
ae Oe al [e? +e” Is 
or 
2 
PQ) So cosh =. 
(p) a € cosh 


Since the time function whose transform is F'(p) is non-negative for all 
t’s and since F(0) = 1, it follows that 


|F(jw) | S 1. (28) 
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8.2 Matrix description of the successwe approximations 


¥rom the developments of Section IV, we know 2,0(£), ¢no(é) and tno’ (é) 
or what is equivalent, the vectors Jo, Ino and Io’. The first approxi- 
mation takes into account the effect of ¢no(t) and Zn’(t) on 7,(t). [See 
equation (1.c) and (1.d)]. The time functions tn9(t) and tno(t) affect the 
system F only during the interval (0, 7). Therefore we must consider the 
vector G(Ino + Ino’) which corresponds to the excitation of the resonant 
circuit. Since the opening of the switch after a closure time 7 forcibly 
brings 7,(¢) to zero we have 


In = G F GU no +- Fas (29) 


where the matrix G has been defined above and the matrix F is a diago- 
nal matrix whose diagonal elements P, (k = --- —1, 0, +1.---) are 
defined by FP, = F(jw, + j2rk). Note that (28) implies that | F,| S 1 
for all k’s. It should be kept in mind that Jo + J, is the first approxima- 
tion to the exact J,(p). 

The next iteration is obtained by first taking into account the effect 
of J on the rest of the network: 


Int —= i 14 
(30) 
Ta = iE I ) 
where F is a diagonal matrix whose elements i, (k = ---, —1, 0, +1, 


---) are defined by HE, = H(jw, + 22k7), and then the effects of Ine 
and Jn’ on I, , that is, 


Te = GIG (Int -- La). (31) 


combining (30) and (81), [2 = 2G FGEI,. <A repetition of the same 
procedure would lead to J, = 2G FGETI,., and in general [,,41 = 
2GFGET,,.. 

Since the nth approximation to J,(p) is given by the sum > cLol yx , 
the successive approximation scheme will be convergent only if the series 


>» Tek 
k=0 
converges. This will be the case if and only if the series 


1 +2GrFGH+:--+ (2GFGE) 4+ --Un (32) 


converges. 
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8.3 Convergence Proof 


Consider a vector X of bounded norm corresponding to a time func- 
tion x(t) having the property that z(t) = Oforr S¢S T and a(t) #0 
for 0 < t < r. In the above scheme, the vector X would be J,, . Let us 
define the vectors Y, Z, U and V by the relations : 


Y = EX, (33) 
Z = GY, (34) 
U = FZ, (35) 
V = 2GU, (36) 
hence 
V=2GFGEX. (37) 


We wish to show that N(V) S$ aN(X) with a < 1, since these inequali- 
ties imply that the infinite series (82) converges. 

Since (a) N; and N2 are low-pass filters with cutoff S mw radians/sec, 
(b) E(p) = 1 for p = 0, (c) E(p) « 1/L,Cp* for p > 1, only a few of 
the ;’s will be of the order of unity In most cases H_;, Eo, E, will be 
smaller than unity, thus, 

N(Y) S N(X). (38) 


In view of the pulsating character of x(t) the power spectrum of x(¢) 
is almost constant up to frequencies of the order of 7/7 radians/sec. 
Because of the low-pass characteristic of E(p), the function y(t) associ- 
ated with the vector Y is smooth in comparison to x(t), thus from (84), 


n@) =f |2@Pa= [ [yO a = avin), 


where a = Q(1). 
Since | F; | S 1 for all k’s, from (83), N(U) S N(Z), hence N(U) = 
brN(Y) with b = 0(1). | 


N(U) = brN(Y) with 6b = O(1). 
From (36) we have 


N(V) = 2f |u@ dt < 2 [| uto dt = 2N(U). 


Thus we finally get 
N(V) = 2b7N(Y) where. b = O(1), (39) 
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and since t < 1 we get from (88) and (39) N(V) = aN(X) witha < 1. 
Hence the convergence is established. 


IX. A MODIFICATION OF THE BLOCK DIAGRAM TO IMPROVE THE ZEROTH 
APPROXIMATION 


In principle it is possible to obtain a block diagram whose transmission 
characteristic 1s equal to the first approximation. In many cases it is 
not necessary to go that far. The first approximation takes into account 
the effect of the currents 7,0(t) and 79’ (¢) on the resonant circuit of Fig. 2. 
Since during the switch closure time the currents 779 and 79’ cannot vary 
much, let us assume that they remain constant for the duration of the 
switch closure. 

Referring to the analysis of Appendix I and to (42) in particular, we 
see that the current 72, is increased by 


2 . -/ 
. eo Wo tn(O—) = ln (O—) 
bin (p) a P+to O° 
or 


sili) = CAO) = AO) 1 — cos wet 0O<it<r. 


Defining S:(p) = &*{4(1 — cos ant)[u(é) — u(é — 7)]}, or 


2 2 2 
_% 1 _ Op +o) 
Pl) Ga) Gta” 





i-2c{[2(p)s(0]]” + [pz(p) seco] 


E4(p) = 


Fig. 8 — Modified block diagram. 
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and recalling that the input of the impulse modulator of Fig. 4 is 
€2(0) — e3(0), it becomes obvious that the modified block diagram should 
be that given by Tig. 8. The output of the modified block diagram is 
given by? 


By(p) — £122 P)LolPdI*Si(p) + ClpZra(p)La(p)I*S2(p) 


1+ 2C{[Z(p) Sip) * + [eZ (p) Sip)" 


X. CONCLUSION 


Let us compare the method of solution presented above with the more 
formal approach proposed by Bennett. The latter method leads to the 
exact steady-state transmission through a network containing periodi- 
cally operated switches. This method is perfectly general in that it does 
not require any assumption relative to the properties of the network 
nor to the ratio of 7/7’. As expected this generality implies a lot of de- 
tailed computations. In particular it requires, for each reactance of the 
network, the computation of the voltage across it due to any initial con- 
dition. The method presented in this paper is not so general because it 
assumes first that the ratio 7/7’ 1s small; second the value of the induct- 
ance ¢ is very much smaller than that of L, (see Fig. 3). The result 
of these assumptions is that the system of time varying equations 
may be solved by successive approximations with the further advantage 
that the convergence proof guarantees that, for very small 7/7, the 
zeroth approximation will be a close estimate of the exact solution. 

The zeroth approximation may conveniently be obtained by consider- 
ing a block-diagram analogous to those used in the analysis of sampled 
servomechanisms. Further the proposed method leads directly to some 
interesting results, for example, as far as the zeroth approximation is 
concerned, the de transmission may be achieved with as small a loss as 
desired provided the lossless networks N; and N» are suitably designed. 
Another advantage of the proposed method is that the simplicity of the 
analysis permits the designer to investigate at a small cost a large num- 
ber of possible designs. 

Finally it should be pointed out that this approach to the solution of 
a system of time-varying linear differential equations may find applica- 
tions in many other physical problems. 


APPENDIX | 


ANALYSIS OF THE RESONANT CIRCUIT 


Consider the resonant circuit of Fig. 2. Suppose that at ¢ = 0, the left- 
hand capacitor has a potential e.(0) and the right-hand capacitor has 
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a potential e3(0) and that at £ = 0 the current 7, through the inductance 
£18 zero. 
The network equation is 


d. 2 i ; = 
tat + G 1, dt = 0. 
Now 7,(0) = 0 and di,(0)/dt = [e.(0) — e3(0)|/£. Let 2/fC = wy, then 


di,(0)/dt = wy Cle(0) — e3(0)]/2. 
Using Laplace transforms, 





@ + wp) = pi) + EO), 
C 1 oy) 
I,(p) = 2 [e2(0) o e3(0)] De we 
hence | 
5 = at eS cree (41) 
and 


q(t) = [ i,(t) dt = C Se [1 — cos wo]. 


If 2r/wp = 27, ie., 7 = r+/lC/2, which means that the duration of the 
switch closure is a half-period of the resonance of the tuned circuit, then 


i, (ft) = a Cle2(0) oa 63(0)] 3 in®, 


qt) = cs) — e() a es(0)) E — cos =) 


T 


It is clear then that, during the period 7, the charge transferred onto the 
L 
000 


Ie (~) 


CWS In(P)+In(P) 
pe +w2 2 





r(P) = 


Fig. 9 — Resonant circuit excited by current sources J, and I,’ . 
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right-hand capacitor is g(r) = C[e2(0) — e3(0)| and as a result at time 
t = 7 the right-hand capacitor has a voltage e:(0) and the left-hand 
capacitor has a voltage e3(0). Considering now the network of Fig. 9, 
the equation 1s 
di, , 2 
de * @ 


Assuming all initial conditions* to be zero we get, 


i, = Fk + 40). 


a wo. I,(p) =f I, (p) | 
Ip) = ae oF (42) 


APPENDIX II 


STUDY OF THE LIMITING CASE 7’ — 0 


We expect that if the sampling period 7 — 0, which is equivalent to 
stating that the sampling frequency w, > «, then the inductance ¢ — 0 
and as a result the voltage e3(¢) will be infinitely close, at all times, to 
the voltage e.(¢). Thus, in the limit, everything happens as if the termi- 
nal pairs (2) of Ni and N2 were directly connected. In that case the gain 
of the system is 


Z12(p) 
2Z(p) 





Ze (p), 


as is easily seen by referring to the Thevenin equivalent circuit of Ni. 

Let us show that as JT — 0, (21) leads to the same result. First note 
that both ZZ) and ZS; go to zero at least as fast as 1/p’ for p > ©. 
Hence the summations in (21) reduce to the term corresponding to 
n = Q. Therefore, 


C{Z Lo] *S1 CZ 21 ZL 1281 pa Zi 


Ex(p) = Lo 20\Z8i\"" "PL WOLZS, 2 


Io. 


APPENDIX III 
ZEROTH APPROXIMATION IN THE CASE WHERE JN, IS NOT IDENTICAL TO No 


Let, for k = 1, 2; C; be the shunt capacitor at the terminal pair 2 of 
N;, Zx(p) be the driving point impedance of N;, and Zp) be the 
transfer impedance of N;. In the present case the capacitors Ci and C. 
are in series in the resonant circuit of Fig. 2. It can be shown that the 


* Their contribution has been found in (40). 
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charge exchanged during one-half period of the resonance is 


2C1C2 


OG, [e2(0) — e3(0)]. 


Tor the present case, (16) and (17) become 


t 


eer ee [ PE = ae 


t 


e, (1) [ _tro(a)aalt = 1) dr. 


Following the same procedure as before we are finally led to the block 
diagram of Fig. 10 whose output is given by 


2C1C 
70° Dio) ——_— 
[Zr2 (p) o(p)] CoG, 
2C1C2 


L$ OE LAD) + Zal)ISiC—)}* 


Si(p) Zr." (p) 
Dp (p) = 


APPENDIX IV 
THE DERIVATION OF EQUATION (24) 


Considering the method used in Section IV to derive the zeroth ap- 
proximation, it is clear that during the switch closure the voltages e2(¢) 
and e3(¢) vary sinusoidally, that is, — 


e(t) = e(0) — e2(0) — e3(0) c ere st), 


2 T 
€3(t) — e3(0) + eo(0) — €s(0) E — cos eI. 
: 2 rT 
IMPULSE 
MODULATO 





Fig. 10 — Zeroth approximation: modified block diagram for the case where N1 
and N» are not identical. 
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Thus, it always happens that for t = 7/2, 1.e., at the middle of switch 
closure time, é2(t) — e3(t) = 0. 
Therefore if we consider the time function e2(¢) — e3(t) we have for all 


k’s (— co, +>. 0, --- + wo), 


eo (wr + ‘) — €3 (ir ++ 4 = 0. 


If, for simplicity of analysis, we assume that the switch is closed during 
the intervals —(7/2) + kT St S +7/2 + kT, then for all k’s, 


e(kT) — e(kT) = 0. 
Using (17), this condition implies that [V(p)]* — 2[L(p)Z(p)|* = 0. 


Now, remembering that 7,(¢) consists of a sequence of half sine waves 
whose shape is defined by s(t) (which is by definition identical to s,(¢) 
except for an advance in time of 7/2) it follows that I,o(p) = B(p)So(p), 
where B(p) is the &-transform of the sequence of impulses whose measure 
is equal to the charge interchanged between N, and Np» at each switch 
closure. Since [B(p)So(p)Z(p)|* = B(p)[So(p)Z(p)}*, then 


[Z12(p) Io(p)|* 
2[So(p)Z(p)]* 


From which it Immediately follows that 


me p) = = [Z12(p) Io(p)|*So(p) 


B(p) = 


2[So(p)Z(p)|* 
and 
_ [Z12(p) Lo(p) ]*So(p) Z12(p) 
Hol?) = SZ 
where 
+co 
[So(p) Z(p)]* 7b & p + jnws) Z(p + jnws) . 
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Kxperimental Transversal Equalizer for 


TD-2 Radio Relay System 


By B. C. BELLOWS and R. S. GRAHAM 
(Manuscript received February 26, 1957) 


To determine the effect of improved equalization on the performance of 
TD-2 radio relay systems, an experimental adjustable transversal equalizer 
has been developed. The equalizer 1s based on the echo principle as used in 
transversal fillers, and operates in the GO0- to 80-mc frequency band. Seven 
pairs of adjustable leading and lagging echo terms provide flexibility for 
stmultaneous .gain and delay equalization. Directional couplers are used 
for tapping and controlling the echo voltages. Field experiments have shown 
that system equalization can be improved appreciably by the use of such 
equalizers. 


INTRODUCTION 


The TD-2 radio relay system! employs frequency modulation to 
transmit multichannel telephony or television. The frequency modulated 
signal requires a transmission system whose gain and envelope delay 
are constant over the frequency band, 20 me wide, used to transmit 
the signal. Deviations from constant gain or delay result in non-linear 
distortion of the demodulated signal. For television this results in dis- 
torted images, and for multichannel telephone transmission this intro- 
duces cross modulation among the voice channels. 

Basic equalization is provided in each repeater. In addition, certain 
fixed equalizers have been used on a mop-up basis. However, there 
remains some residual distortion of random shape. This paper discusses 
the design of an adjustable equalizer to compensate for this distortion. 


I. BASIC EQUALIZATION 


The transmission path through a TD-2 repeater consists of an inter- 
mediate frequency portion covering the 60- to 80-me band and radio 
frequency channels 2G-mc wide in the 3,700- to 4,200-mc band. At each 


1429 


1430 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1957 


repeater the RF channels are separated by wave guide filters and each 
is demodulated to the II* frequency for amplification and equalization. 
The outgoing signal is then modulated back to an RF channel, at a 
different frequency from the incoming signal to reduce interference. 
The RF and IF portions of the repeater were designed so that the com- 
bination would have flat gain to within the closest practicable limits 
over the 20-mc band, and a number of adjustments are provided in the 
IF amplifier to maintain this flatness under field conditions. The unequa- 
lized repeater, however, has an envelope delay distortion characteristic 
shown in Fig. 1 which is approximately parabolic. To minimize this 
distortion, each repeater contains a 315A equalizer, which has approxi- 
mately the inverse of the delay distortion of a typical repeater. 


Il. SUPPLEMENTARY EQUALIZATION 


In a TD-2 system consisting of many repeaters in tandem, both gain 
and delay distortions may accumulate to the point where additional 
equalization is necessary. If the pass band of the repeaters shifts slightly 
in frequency, due to changes in temperature or adjustment, the differ- 
ence between the repeater delay and the delay of its equalizer will result 
in delay distortion which has approximately a linear slope with fre- 
quency. This may be corrected at main repeater stations by combina- 
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Fig. 1 — Over-all delay distortion of a typical microwave repeater, TD-2 
system. 
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tions of delay slope equalizers. The characteristics of the 319A, B and C 
équalizers provided for this purpose are shown in Fig. 2. Each equalizer 
consists of two bridged-T all-pass sections. There is also some variation 
in bandwidth of the TD-2 repeaters, resulting in part from the fact that 
the waveguide filters used in the higher frequency radio channels are 
somewhat broader than those in the lower frequency channels. This 
variation in bandwidth results in delay distortion which has a parabolic 
shape with frequency. Use of a larger or smaller number of the basic 
315A equalizers corrects this. 

Over long circuits, small distortions in the gain shape of the TD-2 
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Fig. 2 — Delay characteristics of 319 type equalizers. Combinations of these 
are used at main stations to equalize delay slope of the system. 


1432 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1957 


equipment produce a cumulative gain-frequency distortion which is 
noticeable in television circuits. Present practice is to correct for this 
by standard video equalizers after the FM signal has been demodulated 
to baseband. In connection with the experimental equalizing program 
to be described, parabolic gain equalizers operating on the I'M signal 
before demodulation were used. 


IlI, RESIDUAL DISTORTION 


After correction of the known shapes discussed above, there remains 
a certain residual gain and delay distortion which results from a random 
summation of many minor sources. The shape of this distortion is not 
predictable, but its statistics are known. Examination of typical delay 
versus frequency characteristics have shown that these may be reason- 
ably well approximated by six cosine terms: a 40-mc fundamental and 
the next five harmonics. Similar gain terms are needed. However, the 
gain and delay distortion, when examined within the 20-mc band of 
interest, do not have a minimum phase relationship. This is to be ex- 
pected because of the presence in the system of the delay equalizers, 
which are non-minimum-phase networks, and of amplifiers with com- 
pression. 

The magnitude of the residual distortion is small enough so that trans- 
continental TD-2 circuits provide television and telephone transmission 
of commercial quality. Some effects, such as cross modulation, are 
sufficiently marginal so that improvement would be desirable. To deter- 
mine whether this could be achieved by improved gain and delay equali- 
zation, the development of an experimental adjustable equalizer was 
undertaken. The considerations outlined show that such an equalizer 
should approximate the desired characteristics with independent gain 
and delay terms of the harmonically related cosine type. Equalization 
to reduce cross modulation in telephone channels and differential phase 
in color television must be performed before demodulation of the FM 
signal to base band. The equalizer was, therefore, built to operate in 
the 60- to 80-me IF band. 


IV. TRANSVERSAL EQUALIZER 


One method of obtaining independent control of the loss and delay 
characteristics of a network has been achieved in the transversal filter.’ 
Equalizers have been designed on this principle for the ‘equalization of 
television circuits.” “ This type of equalizer, referred to here as a trans- 
versal equalizer, provides a flexible means of synthesizing any loss char- 
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acteristic and any delay characteristic limited only by the number of 
harmonics that are provided and the range of each. 

Basically, the transversal equalizer consists of a delay line with 
equally spaced taps, with a means for independently controlling the 
amount of signal fed through each of the taps to a summing circuit, 
as shown schematically on Fig. 3. The input signal is fed into one end 
of the delay line which is terminated at the other end. The center tap 
is fed to the output and forms the main transmission path. 

The operation of the equalizer can best be described using the ‘‘time 
domain”’ analysis based on the theory of paired echos.® Portions of the 
signal tapped off the “leading”’ or first half of the delay line will not be 
delayed as much as the main signal and will introduce leading ‘‘echos’’. 
Similarly, lagging echos can be obtained from the taps on the lagging 
or second half of the delay line. Combinations of both types of echos, 
either positive or negative as required, can be added to cancel out, to a 
first approximation, distortion present in the input signal. 

This analysis can also be carried out in the frequency domain. To 
obtain a family of cosine loss versus frequency characteristics without 
any appreciable delay characteristic, equal leading and lagging echos 
of the same polarity are added to the main signal in the summing circuit. 
To obtain a corresponding family of cosine delay versus frequency 
characteristics without loss distortion, leading and lagging echos equal 
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Fig. 3 — Block schematic of transversal equalizer. 
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in magnitude but of opposite polarity are added in the summing circuit 
to the main signal. 

To achieve a practical equalizer for operation over the 60- to 80-me 
band requires the following components: delay line or delay networks, 
means for tapping off small portions of the signal controlled both in 
amount and polarity, and a suitable summing circuit. 


V. DERIVATION OF THE EQUALIZER CIRCUIT 


A brief analysis of the operation of the equalizer will be given at this 
point as a basis for discussion of the method of tapping the signal and 
controlling the amplitude and polarity of the tapped portion. 

ig. 3 shows the basic delay line PQ as well as the means used for 
producing the main signal and a single pair of leading and lagging echos. 
The tap labeled ‘‘o” in the center of the line produces the main signal. 
The tap ‘‘a’’, being closer to the input, produces a signal which leads the 
main signal by time r. The tap ‘‘b”’ produces a signal which lags the main 
signal by the same amount. The boxes “Kk,” and ‘‘ K,” control the ampli- 
tude and polarity of the leading and lagging signals which are to be com- 
bined with the main signal to produce one term of the desired equaliza- 
tion characteristic. It will be shown that these three signals will provide 
one cosine gain term and one cosine delay term, both having the same 
period, but being independently controllable as to amplitude and 
polarity. 

We will choose as our reference point for phase the main output sig- 
nal, ¢ = Ee’. The output from tap “a” is then 


Eee. 
After passing through box ‘“‘K,’’, this becomes 
e, = K,Ee*”. 
Similarly, 
es ee K,Ee®"™, 
Here the terms K, and Ky are of the form 
K= +e°* 


where a is the attentuation in nepers of the box K. Note that | K | is 
less than unity, assuming the box represents a passive network. Combin- 
ing these two signals with the main signal, we have 


€. = eo teat e = Be fl + Kae” + Kye*’). (1) 
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Now it will be shown that, by the adjustment of the two parameters 
K, and Ky, it is possible to realize independent control of a cosine gain 
term and a cosine delay term. | 

Since, in general, K, # Ky, let us define 


K, = K,+ 4, 
and 
Ky = K, — K,. (2) 
Then | 
GS hen dle ae) eK en ea (3) 
Substituting the pasonometn form: 
| Cp = Ee*'{1 + 2K, cos wr + 7 2K, sin we]. (4) 


Note that for K, equal to K, , the sine phase term is zero and that for 
K, equal to —K, the cosine gain term is zero. Similarly, by proper pro- 
portioning of K, and K, , K, and K , may be assigned any desired values. 

If we normalize (3) by setting e¢ = He’ = 1, the expression in 
brackets can yield two vector diagrams which are useful in explaining 
the functioning of the equalizer. To obtain the diagram shown in Fig. 
A4(a), we have set K, = 0. We then have a unit vector, representing the 
main signal, a leading echo K,e””, and a lagging echo K,e*””. The 





(b) 


Fig. 4— Vector diagrams of paired echos. (a) Equal echos of same polarity 
produce magnitude change without phase change. (b) Equal echos of opposite 
polarity produce a change in phase shift with a minor change in magnitude. 
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vector representing the leading echo rotates clockwise with respect to 
the main signal when the frequency increases, whereas the vector repre- 
senting the lagging echo rotates counterclockwise by the same amount, 
and the resultant thus varies in magnitude but not in phase. The magni- 
tude of the resultant 1s given, for this case, by the first two terms in 
parentheses in (4). 

If, on the other hand, if we set K, = 0, we have the three vectors 
shown in I’ig. 4(b), identical with those in Tig. 4(a) except that the 
polarity of the lagging echo has been reversed. In this case, the two 
echos produce a resultant, e,, which is in quadrature with the main 
signal. lor small echos, e, is thus shifted in phase from the main signal, 
with substantially no change in magnitude. The resultant in this case 1s 
given by the first and third terms in parentheses in (4). This gives a 
sinusoidal variation in the phase of the resultant. Since envelope delay 
is defined as d@/dw, where 8 is the phase shift through the circuit in 
question, the sinusoidal phase ripple will be seen to yield, after differenti- 
ation, a cosine delay ripple. 

The period of the ripple can be seen from the above expressions to 
depend on 7, the delay between the leading and the main tap, and be- 
tween the main tap and the lagging tap. Other pairs of echos, each pair 
symmetrically disposed about the main tap, but with different values 
for 7, will give transmission ripples of different periods. To provide a 
series of orthogonal terms, the values of 7 must be integral multiples of a 
common value, normally that required to produce 180° phase shift 
across the band of interest. 

A complete equalizer must, of course, sum up the various echos and. 
the main signal, taking care that.the delay between the tap and the 
summing point is the same for each echo and the main signal, that 
parasitic losses such as losses in cabling are the same for each path 
through the equalizer, and that any frequency characteristic in the tap- 
ping device or other parts of the equalizer is properly equalized out so 
that the over-all equalizer introduces a minimum of distortion of its 
own. 


VI. DIRECTIONAL COUPLER 


To reduce incidental distortion, it is desirable that the device used to 
tap the delay line for the main signal and the echos introduce substan- 
tially no discontinuity in the main line. The device chosen for this pur- 
pose is a directional coupler. It is shown symbolically in Fig. 5. The di- 
rectional coupler is a four port device having properties similar to a 
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hybrid coil. Power entering one port divides (not necessarily equally) 
between two other ports, but none of it reaches the fourth, or conjugate 
port. In Fig. 5, the power entering at 1 divides between 2 and 4, that 
entering at 2 divides between 1 and 3, that entering at 3 divides between 
2 and 4, and that entering at 4 divides between 1 and 3. Directional 
couplers inherently provide an impedance match at all four ports. Thus, 
such a coupler sets up no reflections in the main line. Its insertion loss 
in this line may be kept small by having nearly all the power enter- 
ing at 1 come out at 2; then only a small fraction is diverted to 4. Co- 
axial directional couplers have been discussed in the literature” ”* and 
will not be dealt with in detail here. 


DIRECTIONAL 
COUPLER 





——> 
Eé 


| aeei@ees 


Fig. 5 — Diagram of directional coupler. Input signal divides between Ports 
2 and 4 with no output at Port 3. Termination Z at Port 4 reflects some signal to 
Port 8, proportional to the reflection coefficient, p. 


The coupler used here (J68333C) is one originally developed to 
measure reflections on II" transmission lines in the TD-2 system. The 
directivity of a coupler is defined as the coupling loss between main line 
and branch line in the undesired direction less the loss in the desired 
direction (loss from 1 to 3 less the loss from 1 to 4, for example). In the 
J68333C coupler, the directivity can be adjusted to exceed 45 db over 
the band of interest. This can be done by adjusting two screws, shown on 
model in Fig. 6, to obtain the optimum spacing between the coupling 
elements. The loss between the main line and the branch line in the 
desired direction is about 23 db at mid-band (70 mc), and decreases 6 
db per octave with increasing frequency. The loss along one of the coupled 
lines (1 to 2 or 3 to 4) is very small. 

Use has been made of the directional properties of the coupler in pro- 
viding a simple means of controlling the amplitude and polarity of the 
tapped signal. Referring to Fig. 5, and keeping in mind the properties 
of the coupler, it will be noted that a small portion of the input signal 
appears at Port 4 of the coupler, but none at Port 3. If the impedance Z 
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Fie. 6 — J-68333C directional coupler with cover plate removed to show cou- 
pling elements. Optimum spacing for maximum Seve can be obtained by ad- 
justing screws. 


matches the impedance seen looking into Port 4, all of the small signal 
will be absorbed in Z. If Z is an adjustable resistance, then a controllable 
portion of the small signal can be reflected back into Port 4, whence most 
of it will come out Port 3. A small portion of the reflected signal will 
emerge at Port 1, headed toward the input. This portion will be attenu- 
ated by twice the coupling loss between the main and the branch line 
plus the return loss of the reflection at Z, and can be made negligible. 
The interactions caused by the reflected signal on the main delay line 
entering previous couplers are also reduced by twice the coupler loss. 
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The coupler in Fig. 5 represents any one of the taps on the line PQ 
in Fig. 3. The signal emerging from Port 4 can be written as He", 
where 6 is a time delay dependent on the location of the tap on the line 
PQ. The signal emerging from Port 3 is then pe’, where p is the 
voltage reflection coefficient at Z, and is given by 


_Rk— fo 

~ R+ Ro | 
Here F& is the value of resistance used to provide the impedance Z, and 
Ro is the impedance seen looking into Port 4. An examination of the 
signal from Port 3 shows that it is the same as the signals e, or e in Fig. 
3, with the reflection coefficient p substituted for variable K, or Ky in 
Fig. 3. Thus, it is seen that we may use the reflection at Z, variable by 
controlling the value of R, to perform the function of the ioe K in Fig. 
3. Neglecting parasitic ieee we may then write: 





_. _ B— Ro 
cela aa ems 
and 
_ l1+k 


This gives us the value of R to use for any desired value of K for any of 
the taps which derive echos, assuming the summing circuit has equal 
attenuation in all paths. In the case of the main central tap, the signal 
from Port 4 of the coupler is seen to be the same as the main signal ép in 
Fig. 3, and is used as such directly. 


VIT. METHOD OF ADJUSTMENT 


The detailed design of a manually adjustable equalizer is materially 
influenced by the method to be used in the field for determining the 
setting of its controls. The present equalizer with 14 independent con- 
trols would present a complex problem of field adjustment unless special 
procedures were developed to simplify the adjustment. To adjust the 
equalizer, the radio circuit being equalized must be taken out of com- 
mercial service, so any reasonable measures to simplify the adjustment 
or reduce the time required are justified. 

Two methods appeared to be feasible at the time the development 
was started. One would be to use existing gain and delay sweep test 
circuits. These present a visual display of the circuit gain or delay dis- 
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tortion versus frequency. These displays are not available simultane- 
ously with present equipment. To adjust the equalizer controls using 
this equipment, it must be possible to adjust either gain or delay without 
affecting the other. Thus, all the gain terms can be adjusted in suc- 
cession using the gain display. Then the procedure is repeated with the 
delay display, adjusting the delay terms. Since a combination of leading 
and lagging echos in equal amounts is required for this procedure, an 
arrangement of the controls to facilitate this is required. One way to 
achieve this is to use stepped rheostats with the steps proportioned to 
introduce equal amplitude changes in the echo voltage. With this ar- 
rangement, gain changes can be introduced by rotating the two switches 
corresponding to a pair of echos in the same direction an equal number 
of steps. Delay changes can be obtained by similar rotation in opposite 
directions. A further refinement consisting of mechanically ganging the 
controls is possible but this was not done on these experimental models. 

The second method of adjustment would be to develop a special test 
set similar to the one developed for the L3 system.® This could produce 
a meter reading proportional to the amount of gain and delay distortion 
present in the circuit. Successive controls could then be adjusted for min- 
imum meter readings. Experience with the L3 system cosine equalizers 
has shown the desirability of continuously adjustable controls for such 
a method. 

To test both methods under field-trial conditions, two versions of the 
equalizer were built — one with stepped rheostats and one with con- 
tinuously adjustable rheostats. 


VIII, COAXIAL RHEOSTAT 


Since there were no available continuously adjustable rheostats sat- 
isfactory for operation at 70 mc, a special rheostat was developed for 





_ Fig. 7— Schematic of coaxial rheostat. Moveable sleeve changes position of 
inner contacts touching ceramic rod, changing resistance. Fixed outer contacts 
maintain constant path length to frame. 


EXPERIMENTAL TRANSVERSAL EQUALIZER FOR TD-2 1441 





Fig. 8 — Model of coaxial rheostat with cover removed. 


this purpose. It employs a ceramic rod, + inch in diameter, coated with 
a pyrolytic carbon film, asa centermember of a coaxial structure. A metal 
sleeve which is moved longitudinally by a lead screw carries sliding con- 
tacts along the rod. These parts are supported inside a rectangular hous- 
ing which forms the outer conductor. A second set of fixed contacts at- 
tached to the rectangular housing makes contact with the sleeve. This 
arrangement maintains a substantially constant length of path from 
the input end of the rod to the housing, which forms the ground, inde- 
pendent of the position of the sleeve. The schematic of the rheostat is 
shown in Fig. 7. A model of the rheostat is shown in Fig. 8. 

To obtain uniform adjustment of amplitude in decibels, a resistance 
that varies exponentially with length or with rotation of the lead screw 
is required. Such a resistance characteristic is realized by varying the 
thickness of the carbon film along the rod. This produced a total re- 
sistance which varied from 20 ohms at the low setting to about 350 
ohms at the high resistance setting. After an initial wearing-in period 
of 1,000 cycles of moving the contacts over their full travel, the resist- 
ance was changed less than 1 per cent by another 9,000 cycles. This. 
amount of wear is estimated to be greater than that encountered in 
twenty years of normal operation. 

The housing and rod were dimensioned to form a 75-ohm transmission 
line. Measurements of the impedance at the input connector, made at 
frequencies from 60 to 80 mc, showed that this impedance can be approx- 
imated by a resistor terminating 6.7 cm of 75-ohm coaxial cable. For 
the 75-ohm setting, the reflection coefficient of the rheostat is less than 
2 per cent across this frequency band. | 
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One model of the equalizer was completely equipped with these rheo- 
stats. By allowing for the equivalent length of cable within the rheostat, 
an essentially pure resistive termination was obtained. 


IX. OTHER COMPONENTS 


Other components required for the equalizer included stepped rheo- 
stats, delay line or delay networks, a suitable summing network, and a 
loss equalizer. | 

The stepped-switch rheostats were made from standard switch parts 
with eleven positions. Deposited carbon resistors, 205D, were used for 
the steps. The mid-position corresponded to the circuit impedance level, 
75 ohms. The other steps were arranged to provide equal increments of 
echo amplitude measured in decibels. With careful control of lead lengths, 
special shielding and a coaxial cable connector, satisfactory control of 
the return loss of this rheostat was obtained. 

Resistance pads were added to the switch assemblies associated with 
each of the echo terms. The loss of each pad was determined so that the 
corresponding term would have the desired maximum amplitude. In 
addition, the losses of the pads associated with the leading echo terms 
were increased to compensate for the midband loss of the delay line be- 
tween the leading coupler and the corresponding lagging coupler. This 
insured that the two echos would have equal amplitudes. 

The delay required between taps in the delay line is 0.025 microsecond, 
corresponding to a change in phase shift of 180° from 60 to 80 mc. In 
order for the equalizer cosine characteristics to have maxima at the band 
edges, the total phase shift at 60 and 80 mc must be successive integral 
multiples of 180°. Since the phase shift of coaxial patch cable is closely 
linear and proportional to length, it could be used for the delay line. 
Lumped-element delay networks consisting of two or more all-pass sec- 
tions are a feasible alternative and would reduce the over-all size and 
weight. In view of the additional development effort involved to produce 
these and the experimental nature of this equalizer, it was decided to 
use coaxial patch cord. The type selected, 728A cable, has a polyethylene 
dielectric and is tested during production for return loss in the 50- to 95- 
mec band. The length required for each section is about 15.6 feet. This 
much cable has a loss of about 0.3 db at 70 mc. 

It was originally proposed to use a series of directional couplers for 
summing the echo voltages with the main signal. This would provide 
additional isolation between terms. However, tests on a preliminary 
model indicated this isolation was not required in this application. In- 
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Fig. 9. — Summing network with cover removed. Main signal input is at right, 
echo signal inputs on top, and output is at left.. 


stead, a resistance summing network was developed using deposited 
carbon resistors. An L-pad is used in each echo path and a series resistor 
is added to the main path to preserve the 75-ohm impedance level, intro- 
ducing a main path loss of about 0.4 db per tap. A model with cover 
removed is shown on Fig. 9. The main signal is introduced at one end 
of the structure, the echo voltages are connected along the side and the 
sum is taken off the other end. The return loss measured at any of the 
connectors with the others terminated was of the order of 40 db over the 
60- to 80-me band. 

An attentuation equalizer is required to make the transmission through 
the main path constant. This path consists of about 108 feet of 728 cable, 
the straight-through loss of six couplers, and the coupling loss of the 
main coupler. The net distortion over the band is a slope of about 1.5 
db and is corrected for by a constant resistance equalizer. Return losses 
exceeding 34 db were obtained over the frequency band. 


xX. ASSEMBLY 


All the components were mounted on the rear of a standard relay rack 
panel. The rheostat controls are arranged on the front of the panel as 
shown on Fig. 10. This is a front view of the completed equalizer. ‘The 
controls for the leading echo terms are on the left and for the lagging 
echo terms on the right. They are arranged vertically in numerical order 
with the first terms (shortest time separation from the main signal) at 
the top. 

The rear of the panel is shown on Fig. 11. The directional couplers 
are mounted horizontally in two vertical columns. The cables forming 
the delay line sections are terminated in a plug and a jack and these are 
inserted in successive couplers, from the second port of one to the first 
port of the next. The third port of each coupler is connected through a 
short cable to its corresponding rheostat assembly. The fourth port of 
each coupler is connected to the summing network. An exception is 
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Fig. 10 — Front view of equalizer, showing rheostat controls. Leading echo 
controls are on left, lagging ones on right. First harmonic controls are at top. 
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the middle coupler, the fourth port of which is terminated in 75 ohms 
and the third port connected to the summing network. 

The envelope delay in the cables connecting each coupler to the rheo- 
stat and to the summing network appears as delay for the particular 
echo path. Since these delays are not negligible compared to the 25 milli- 
microsecond delay between echos, the cable lengths were controlled so 
that the same amount of additional delay was introduced into each path 
including the main path. 


XI. ADJUSTMENT AND PERFORMANCE 


After the equalizer was assembled, the length of each of the cables 
connecting the couplers to the summing network was adjusted so that 





Fig. 11 — Equalizer with cover removed. Cables wound outside frame are the 
delay line sections. Summing network and directional couplers are in center. Rheo- 
stat cases are mounted on panel with coaxial connection in rear. 
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the zeros of the cosine shape occurred at the proper frequencies, as ob- 
served when the associated rheostat was set at maximum and minimum 
positions, with all other rheostats set at midrange, or no-echo, setting. 

Some reflections were present in the main signal path as evidenced by 
ripples in the gain characteristic when all rheostats were set at midrange, 
corresponding to the ‘‘flat”’ loss condition. These reflections were reduced 
to some extent by minor readjustments of the balancing screws on the 
directional couplers. The over-all flat gain characteristic obtained after 
these adjustments is shown on Fig. 12. 

This figure also shows the seven gain characteristics obtained when 
each pair of rheostats is set for maximum gain. The markers on the refer- 
ence trace correspond to the band edges. A sharp gain bump resulting 
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Fig. 12 — Measured gain characteristics of equalizer. 
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Fig. 13 — Gain change introduced by changing delay terms from zero to maxi- 
mum. Left, normal case. Middle, third harmonic term at maximum. Right, fifth 
harmonic term at maximum. 
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from reflections on the delay line occurs just above 80 mc and distorts 
each characteristic near thisfrequency. The delay characteristics obtained 
closely resemble the corresponding gain characteristics. The delay char- 
acteristic with all rheostats on midrange was flat to within about +1.5 
millimicroseconds. 

Another measure of the performance is the amount of interaction 
between gain and delay characteristics. As shown on Fig. 13 the gain 
changes less than 0.2 db when the third or fifth harmonic delay terms 
are set at their maximum values of 11 and 12 millimicroseconds, respec- 
tively. Similarly, when a pair of rheostats are set for the maximum gain 
characteristic, the effect on delay is of the order of two millimicroseconds. 
These results, which are typical, indicate the interaction effect is of the 
order of 20 per cent using one neper of gain distortion ripple as equiva- 
lent to a ripple of one radian of phase shift amplitude. The effect of this 
interaction on the field use of the equalizer is to require a second round 
of equalization to correct for interactions after the gross distortions in 
a circuit have been equalized. 


XII. FIELD EXPERIMENTS 


Models of this equalizer were installed in two channels of the TD-2 
system between Denver, Colorado, and Omaha, Nebraska, early in 
1956. This route is about 500 miles long and includes 18 microwave links. 
One equalizer was installed at the center of the route and a second one 
at the receiving end. The results of a typical set of characteristics ob- 
tained are shown in Figs. 14 and 15. The first shows the delay charac- 
teristic of the whole channel measured at the receiving intermediate 
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Fig. 14— Measurements on Denver-Omaha route. Envelope delay distortion 
sree at intermediate frequency point. Left, unequalized; right with equalizer 
adjusted. 
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Fig. 15 — Measurement on Denver-Omaha route. Transmission characteristic 
measured at video (a) unequalized, (b) with equalizers adjusted. 


frequency point. The “‘unequalized”’ characteristic 1s the circuit delay 
distortion immediately after standard line-up procedures without video 
equalization. The “equalized” characteristic shows the same circuit 
distortion corrected by the addition of the transversal equalizer. The 
first two delay terms of this equalizer were supplemented by the use of 
319 type (linear slope) and 315A (parabolic) equalizers. The second loss 
term was supplemented by the use of experimental fixed parabolic loss 
equalizers. An attempt was made to obtain the best performance over 
the center 10 me of the band, corresponding to the first order modulation 
band. | 

The gain distortion was adjusted on a demodulated video basis as 
this was the only type of sweep gain circuit available. As shown in Fig. 
15, the unequalized circuit had more than 0.5 db per mc of slope up to 
10 me. The addition of the equalizer produced a flat band to about 8.5 mc. 

The effect of this equalization on cross modulation, measured by simu- 
lating the message load with a flat band of noise, is shown in Fig. 16, 
for two sample channels. In these curves, normal drive represents the 
noise load whose power is 12 db below the power in a sine wave giving 
4-me peak deviation of the TD-2 carrier. Channel A showed 5-db im- 
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Fig. 16 — Effect of improved equalization on cross modulation noise, Denver- 
Omaha route. Measurements on two channels, referred to — 9 db transmission 
level. 


provement at normal drive, and a somewhat greater improvement at 
higher drives. Channel B, however, showed a slight degradation at 
normal drive, which becomes a 9- or 10-db improvement at high drive. 
In the case of the latter channel, the improvement at normal drive was 
limited by the presence of a delay ripple, due to waveguide echoes, 
which was of too short a period to be equalized by the present equalizer. 

The realization of such improvements in a working system is limited 
by such waveguide echoes, which are not stable enough for ready equal- 
ization, as well as by other instabilities in the transmission characteristic 
which have been attributed to antenna and air path effects. 
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Transmission Aspects of Data Transmission 
Service Using Private Line Voice 


Telephone Channels 


By P. MERTZ and D. MITCHELL 
(Manuscript received February 2, 1957) 


An exploration is reported of the possibilities of a moderately fast data 
transmission system to use private line message facilities. A comparatively 
conventional system was desired to permit expeditious application. An 
auxiliary “word start” signal was necessary for the system considered. 

Transmission characteristics of a number of arrangements were examined. 
These included several exploratory AM vestigial sideband systems (using a 
spectrum similar to telephotography), double sideband AM systems, various 
telegraph and other multichannel systems. 

It was concluded that the 1600 bits per second usable over the AM vestigial 
sideband arrangement was about as fast as could be expected with the data 
system contemplated. This transmission 1s not as “rugged,’’ with respect to 
impulse noise and sudden level changes, as other slower arrangements, but 
it 1s expected to be satisfactory. It will require delay correction, and simple 
methods are considered for carrying this out. 


I. INTRODUCTION 


The Bell System has been approached on a number of occasions in 
regard to the transmission of computing machine and similar data over 
its telephone circuits. This has reached the point where specific possibil- 
ities for private line data transmission have been given serious consider- 
ation. | 

The telephone network was developed for speech transmission, and 
its characteristics were designed to fit that objective. Hence, it is recog- 
nized that the use of it for a distinctly different purpose, such as data 
transmission, may impose compromises both in the medium and in the 
special service contemplated. 

A short time ago the authors were assigned the problem of examining 
possibilities for such an adaptation aimed at high speed, and exploring 
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the nature of the transmission compromises that might be needed. As 
a result, a variety of data transmission systems in different stages of 
development have been investigated including some telegraph systems. 
Certain conclusions have been reached regarding their suitability for 
use over private line telephone facilities. (By ‘private line’’ facilities 
are meant facilities leased to the subscriber on a more or Jess permanent ° 
private basis, and that are not set up by operators at a telephone central 
office switchboard.) 

These conclusions are summarized below. In the later text there is 
included the background material for the conclusions which includes a 
brief characterization of the facilities. Finally there is included a dis- 
cussion of the line treatment which may be needed for the best trans- 
mission. | 

It should be emphasized that not all possible data systems for use 
over telephone circuits are covered. The problem considered covers 
particularly some recently proposed applications, for which the need of 
a relatively high bit rate is important. Also, only the more promising 
comparatively conventional systems, which have been relatively well 
tested and can be readily applied, are considered. More radical designs 
are conceivable but they would require more extensive investigation 
before conclusions could be reached concerning them. It is clear also 
that the designs involved in the choice of a system are determined by 
the type of service it is to provide. 


1.1 Conclusions — 


It is concluded that about the fastest transmission of data which can 
be accomplished with the present art over message-type telephone 
facilities is obtainable with an amplitude modulated vestigial sideband 
system. Such a system will be presented in some greater detail below. 
Its frequency spectrum is similar to that of a telephotograph signal? and 
a number of the transmission problems involved are the same. 

This system will provide about 1600 binary digits of data (or “bits’’) 
per second, but it requires some special selection and considerable treat- 
ment of many types of circuits. This treatment is necessary to reduce 
noise, particularly of the impulsive type, and to correct for delay dis- 
tortion. 

Such a system is therefore considered suitable where a high bit rate 
is essential. It will be developed in the text that beyond the matter of 
the delay correction and other treatment of circuits required, the vestigial 
sideband process imposes a certain signal-to-noise penalty. A further 
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signal-to-noise penalty comes from the method of multiplexing an aux- 
iliary channel needed for word start indications. 

Where a lower bit rate is acceptable, a 750 bit per second amplitude- 
modulated, double-sideband system which is now available, and has 
been tested extensively, is somewhat more rugged. It permits operation 
over the great majority of telephone facilities and will also be described 
in more detail. 

The most rugged system considered uses frequency-shift transmission. 
This form of transmission has been used extensively for some time on a 
multiple-channel, voice-frequency telegraph basis in which each channel 
is capable of 46 to 74 bits per second. When used in this form to handle 
high speed data signals, it requires relatively complicated terminal 
devices because the several channels have to be merged to provide one 
high speed data system. However, when frequency shift is used as a 
single channel over an untreated telephone message facility the system 
promises to be relatively simple and to give a total possibility up to 
1,200 bits per second according to the type of facility. 


1.2 Summary Table 


These findings are concisely grouped in a summary table, Table I. 
These entries are based upon present knowledge and are believed to be 
reasonably accurate, although estimates regarding impulse noise need 
more extensive checking, particularly in the case of the broad-band, 
frequency-shift system. 

The table compares relative estimated performances of three broad- 
band systems among themselves, and with a subdivided channel (or 
telegraph) system. The performances considered cover the effects of 
noise and delay distortion, and the bit rates considered achievable in a 
1,000-cycle band and in a 300- to 2,800-cycle telephone band. Some 
crude approximations covering speech are also given as a matter of 
interest. 

The noise performance is given in terms of relative total power capac- 
ity required in the line for a given error rate in the presence of a given 
noise. The double sideband system is taken as reference. Allowance is 
made in the multiple channel or telegraph system for occasional peaking 
caused by temporary unfavorable phasing. In this part of the comparison 
a 12 channel system has been assumed. This 1s about as many channels 
as can be used on a telephone facility that has heavy impulsive noise. 

The delay distortion figures represent some present ideas on good 
engineering design in the allowable impairment of signal-to-noise ratio. 
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TABLE I — SUMMARY OF VARIOUS DATA SYSTEM CHARACTERISTICS 


Approx. Max. 
Relative Peak Power Re- |Delay Distortion 
quired in Line for Equal | in Millisec. Al- 
Performance* in Presence] lowable in Band 


of Noise, db (for 3 alan Boron rs ges SRE 
: penalty _jp {per Second in Tele- 
Use in Band ee pene fe db phone Facility 300- 


Bue: Noise for DSB) 2,800 Cycles 
Within 1000 


Ran- 
SF Impulse|_O 
dom pope [Channel], S¥¢!e 
Broadband VSB; +6 | +6 | +6 {+0.4 ;+0.4 1000 | 1600** 
Broadband DSB |_ 0 0 QO |+0.55)+0.55 700 800 to 1400** 
Broadband FS —3 —3 —7{ |%0.5 |40.5 650 750 to 1200** 
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Telegraph Comparison 














43A1 Channel +1034| Or! — 1044/45 60 450 1100 
( 1 hannels - (6 Channels) | (15 Channels) 
Speech Comparison 
Speech +10 +10 +10 [410 | | | 50 


* High grade performance; i.e., less than 1 error per 100,000 bits. 

** High figure assumes accurate delay correction and control of nonlinearity. 
+ Depends on precise line-up of filter and carrier. 

tt Allowance for peak factor of 12 frequencies. 

VSB = Vestigial sideband; DSB = Double sideband; FS = frequency shift. 


This impairment is here assumed to be about 3 db. A rather larger 
impairment has on occasion been assumed by other authors.’ In the 
case of the telegraph system some arrangements for merging the signals 
in parallel channels into a single high speed channel require a certain 
exactness in timing correlation. The need for this may be overcome by 
the use of a small amount of data storage in the receiver of each channel. 
The delay distortion figures quoted in the table assume no need for this 
timing exactness. As in the other part of the comparison twelve chan- 
nels are assumed. | 

The last two columns indicate the bit rate that can be expected of the 
various systems, first per 1,000 cycles of band, and second for a tele- 
phone facility of somewhat narrow (but frequently encountered) band- 
width. Some of these figures assume a careful control of delay distortion 
and of nonlinear distortion. In the 1,000-cycle band only six channels 
of the telegraph assumed may be accommodated. In a 2,500-cycle band 
the number can be extended to 15. The band that can actually be used 
for telegraph, over a given facility, depends upon the nature of that 
facility. 


PRIVATE LINE DATA TRANSMISSION 1455 


Some comparison data are indicated for telephone speech. The bit 
rate given assumes particularly message communication and not finer 
shades of artistic expression. For this a collection of phonemes (or 
elementary sounds) in the fifties, needing six bits for identification, and 
a typical speech rate of eight phonemes per second, require 48 bits per 
second. A few bits are also needed for pitch indication, and the figure 
is rounded to 50. 

The low bit rate obtained for telephone speech communication indi- 
cates considerable redundancy in the speech signal sent over the tele- 
phone channel. This suggests why substantial transmission impairments 
can be tolerated without destroying the intelligibility of speech, as 
compared with telegraph or data signals. 


1.3 Nature of Study 


The procedure followed has consisted of examining systems of binary 
or similar signal transmission which appeared suitable for the sending 
of data. The systems studied are listed here, and some description of 
them is given later in the text. As noted, part of the information has 
come from outside the Bell System. 

1. Exploratory 1,650 bit per second vestigial sideband system studied 
at Bell Telephone Laboratories. 

2. Exploratory 1,600 bit per second vestigial sideband system studied 
at Lincoln Laboratory of M.1.T.! 

3. Exploratory 750 bit per second double sideband system, of general 
type reported by Horton and Vaughan.’ | 

4, Voice frequency telegraph channels.° 

5. “Polytonic” signaling system reported by Lovell, McGuigan and 
Murphy.’ 

The transmission problem of applying these various systems to the 
various types of telephone message facilities employed for private line 
service has been considered. These are also listed, and a brief character- 
ization of them is given in the text. 

1. Voice frequency circuits, over cable and open wire. 

2. Type-C carrier circuits for open wire.!® 

3. Type-N carrier circuits for cable.® 

4. Broad-band carrier systems! using A channel banks for paired 
cable, coaxial cable, open wire, and microwave radio. 

5. Other broad-band carrier systems. 
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1.4 Outline of Paper 


As a result of the study, some recommendations were made. The dis- 
cussion of this material is presented in Section II. It covers first the sort 
of service, with comparatively high bit rate, for which there has been 
user demand, and which can be furnished over private line facilities. 
Secondly, the recommendations cover the broad features of the signal 
characteristics which appear promising for use over such facilities. These 
recommendations are at the present time evolving into an exploratory 
system. The recommendations themselves, together with some general 
remarks, are presented in Sections 2.0 and 2.1. The background material 
on which these were based, and which covers consideration of the five 
systems which have been listed, is presented in Sections 2.2 to 2.5. 

The discussion on the nature of the problems involved in transmission 
of the signals over the telephone plant to be used (which above has 
been listed in five categories) is finally covered in Section III. 


II. SYSTEMS OF BINARY SIGNAL TRANSMISSION 


2.0 General Remarks 


There are a number of arrangements in the present art, both experi- 
mental and commercial, which are capable of sending digital data infor- 
mation under something like the conditions required for the service 
considered. Study of these has led to a set of recommendations which 
are outlined below. The specific arrangements are then discussed in 
more detail. The description covers only the essential transmission 
characteristics of the arrangments. 

The arrangments generally divide into two groups, those which are 
essentially short-pulse single channel (though some of these may include | 
a slow auxiliary channel), and those which use frequency division multi- 
plex channels and therefore employ longer individual pulses. One impor- 
tant advantage of the multiple channel systems is noted in Section 3.3 
as consisting of an increased immunity against impulse noise. 

The single channel group shows much similarity among the systems. 
The principal difference is that of bit rate. The faster systems use ves- 
tigial sideband transmission, at the expense of a certain increase in 
vulnerability to noise as compared with those which use double side- 
band transmission, and also at the expense of a more general need for 
delay distortion correction because of the speed. 

The systems in this group which use vestigial sideband are expected 
to be able to operate, with a very low error rate (some 1 error per 100,000 
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bits), over telephotograph type circuits, which employ delay distortion 
correction. This is a ‘‘normal”’ condition error rate, and does not in- 
clude abnormal circuit conditions, such as static, trouble conditions, 
etc. It also assumes a more or less even error distribution. 

One of these systems gives very desirable word start indications by 
using an additional level of signal. ‘‘Words” are groups of signal elements 
of fixed total number each. A distinctive separation signal greatly 
simplifies their recognition at the receiver, particularly after short line 
interruptions. 

As the price, however, of both the speed and the third level signal 
indication, this system is more vulnerable than the others to impulse 
noise, of the kind encountered in hitherto installed N1 carrier and open- 
wire circuits, which have not been treated for impulse noise. It is also 
more vulnerable than the other systems to sudden level changes in the 
received signal. 

The multiple channel group is generally characterized by a lower bit 
rate. Of all of them only one, the frequency-shift carrier telegraph, 
shows capability of consistent operation over untreated N1 carrier. On 
this type of carrier the frequency-shift telegraph permits a bit rate of 
the order of only half that obtainable with the vestigial-sideband sys- 
tems. This telegraph system performs much better over other types of 
telephone facilities but even then its bit rate is only about three fourths 
that desired. 


2.1 Digital Data Transmission Service 


2.1.1 Some Desirable Major Requirements 

1. Transmission of a maximum of 1,600 bits per caecond with an 
error rate not to exceed one in every 100,000 bits (or once per minute). 

2. Applicability to most telephone facilities for private line use. 
Some selection of circuits and some treatment of those selected will be 
acceptable. The circuits carrying data are to be one-way terminal 
circuits only (1.e., not to be linked in tandem). It is expected that for 
the bulk of the service the circuits would not be likely to run over some 
200 to 300 miles in length, but a small number of 3,000-mile circuits 
is considered possible. 

2.1.2 Characteristics of the System 

The system which has been considered and recommended as promising 
for the service outlined above has the following essential characteristics: 

1. Carrier at 2,000 cycles, with vestigial band extending up to some 
2,400 cycles. Lower nominal effective band (half the bit rate in width) 
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extends down to 1,200 cycles, and roll-off band down to about 1,000 
cycles. The frequency space above the vestigial band and below the 
roll-off band is essentially ‘““dead”’ space, free of signal energy. 

2. The signal comprises three components: 

(a) synchronization, (or start), to indicate word separations, 

(b) data, or the actual information, and 

(c) taming, to mark out the successive bit intervals in the data. 

The signal is represented in Fig. 1 as the envelope of a carrier. The 
synchronizing signal has an amplitude of one and a duration of one 
signal element. Data spacing signals have an amplitude of about 0.63 
and a duration each of one signal element. Data marking signals have 
an amplitude of about 0.25 and also one signal element duration. It 
will be noted that this is an “‘upset’’ signal, in which spaces have more 
power than marks. The two signal elements immediately before, and 
again the two signal elements immediately after, the synchronization 
signal are always to be spacing. A timing signal is not actually sent, 
but instead the synchronizing signal is used to pull the phase of a highly 
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Fig. 1 — Multiple level signal. 
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accurate oscillator or ‘“‘clock” at the receiver into the proper phase 
relation at the beginning of each word. 

The phase of the oscillator is readjusted slightly as necessary at the 
beginning of each word but this adjustment is purposely made sluggish 
so that an occasional noise burst will not throw timing out badly. Thus 
the timing information is, in effect, by its repetitive characteristic, highly 
redundant. The receiving device is capable of getting in step without 
any manual adjustment but it may require as many as 10 words (or 
that number of synchronizing pulses) before it gets into exact phase, on 
an initial connection or after losing synchronization. 

2.1.3 Transmission Considerations 

Such a signal leads to the following transmission considerations: 

1. The signal is rather similar in its spectrum to a telephotograph 
signal, and as a first order approximation requires about the same type 
of facility for its transmission. 

2. In particular, the signal is expected to require something like the 
same over-all delay equalization as the telephotograph signal. A brief 
discussion of this point was given in a paper by one of the authors.’ The 
conclusions there are that the telephotograph equalization limits of 
+0.4 times a signal element duration in envelope delay are generally 
reasonable, although these are probably overly severe with respect to 
very fine structure irregularities in the residual envelope delay charac- 
teristic. The formal limits which have been set on the envelope delay 
distortion for the 1,600 bit per second signal are +250 microseconds. 

3. These limits constitute a rather less severe problem for the bulk 
of the expected circuits (200-300 miles) than they present for telephoto- 
graph circuits which are equalized for 3,000- to 5,000-mile lengths. For 
the small expected number of very long circuits the problem would of 
course be the same as for the telephotograph service. 

4. The delay equalization problem requires special consideration in 
view of the nature of practices which have developed in the telephoto- 
graph art. The number of circuits which have been equipped for tele- 
photography has in the past been rather small. The adjustment of the 
delay equalization has involved arithmetical calculation in the process 
of fitting various manufactured delay equalizer sections to the measured 
delay distortion. These methods are not generally suitable for large 
scale operations. It is believed that a process of equalization by prescrip- 
tion will be useable for the 200-300 mile circuits. This 1s discussed in 
more detail in Section 3.3. 

5. So far, it has not been found expedient to transmit telephotograph 
signals generally over unmodified N1 carrier or other compandored 
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circuits, and the same problem is appearing with the proposed data 
signal. The problem is discussed in more detail in subsequent sections. 

6. The specific signal suggested has the disadvantage that multiple 
levels must be discriminated by the receiver. This increases vulnerability 
to noise and level changes, as will now be discussed in some more detail. 

The specific signal which has been suggested, as noted before, is 
outlined in Fig. 1. This signal comprises several levels. In the first place 
it includes a word start indication, on a separate level from the mark 
and space indications. In the second place the lowest amplitude level 
(normally a space, but a mark in the “upset” signal, as has been noted) 
is not made zero, but 0.25 the amplitude of the word start indication 
or “‘synchronizing”’ pulse. The need for this is explained in the discussion 
on vestigial sideband, below. | | 

Consider for the moment a signal of only two levels; these can be 
taken as 1 volt and 0 volts respectively. The discrimination between 
them without error requires that an instantaneous noise pulse at this 
time be kept to less than 4 volt. In these terms this represents an S/N 
ratio of 6 db. 

The use of 0.25 volt minimum signal means that the amplitude range 
between maximum and minimum js reduced from 1 volt to 1 — 0.25 = 
0.75 volts. The maximum allowable noise pulse must be 0.75/2 = 0.375 
volts for this signal. This 1s an 8.8-db S/N ratio, as compared with the 
previous 6 db. It represents a handicap of approximately 3 db which 
must be accepted as part of the price of the increased bit transmission 
rate permitted by the use of vestigial sideband as compared with double 
sideband transmission. | 

An additional penalty comes from the use of three as against two 
signal levels. The spacing signal level is set at 0.625 volt, midway be- 
tween 1 and 0.25 volt. Discrimination between synchronization and 
spacing signals can tolerate noise pulses of (1 — 0.625)/2 = 0.1875 
volt. This amplitude is 15 db below synchronization level. Discrimina- 
tion between spacing and marking signals tolerates maximum noise 
pulses of (0.625 — 0.25)/2 = 0.1875 volt. This is again 15 db below 
synchronization level. Thus the signal tolerates a 15-db S/N ratio 
between synchronization level and the level of maximum noise pulses. 

The difference between the approximate 9-db S/N for the two level 
vestigial sideband signal and the 15 db represents the 6-db handicap 
caused by the multiple level discrimination in the signal. This is the 
price paid for a distinctive word start indication. 

The price also applies to sudden level changes. In a two level signal 
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between 1 and 0 volt, a sudden drop of 6 db (without compensating 
change in the “‘slicing”’ or critical level) causes error. 

In the two level signal between 1 and 0.25 volt, the permissible drop 
is reduced to 4 db (ratio of 0.625/1). 

In the three level signal, the permissible drop is reduced to 1.8 db 
(ratio of (0.625 + 0.1875)/1). 

Automatic gain control and corresponding adjustment of the slicing 
level ameliorate these conditions to some extent, but the problem is still 
a serious one; a sudden rise is also serious. 

The use of a compandor in the transmission facility éxaevenates the 
situation. Some discussion of the action of a compandor to improve the 
signal-to-noise ratio for speech is given in Section 3.1. For the moment 
it can be said that, at the transmitting end, the compandor compresses 
the range of amplitudes in the speech signal at approximately a syllabic 
rate. At the receiving end an expansion restores the original amplitude 
relationships in a complementary manner. 

The compression and expansion are matched almost perfectly as far 
as the human ear is concerned. However data transmission is more vul- 
nerable to short time level irregularities. This allows small imperfections 
in the amplitude restoration to impose a further penalty in the form of 
error hazards. 

2.1.4 Other Considerations 

Present ideas call for the acceptance of the signal by the telephone 
company, and redelivery to the subscriber not in the form indicated 
by Fig. 1, but in the form of the three separate components listed in 
2.1.2. This presents some problems regarding the transmission of such 
signals over the connecting loops between the subscriber and the tele- 
phone office. These are not, however, germane to the general transmis- 
sion problem and will not be considered further here. 


2.2 Bell Telephone Laboratories and Lincoln Laboratory Vestigial Side- 
band Systems 


The first of these represents an unpublished exploration, particularly 
by C. B. H. Feldman and A. C. Norwine, of the possibilities of trans- 
mitting moderately high speed data pulses over telephone facilities. 

The exploratory system used a carrier at 2,200 cycles; a vestigial band 
extended up to 2,600 cycles; the nominal effective band extended down 
to 1,375 cycles; and a roll-off band continued down to 1,100 cycles. In 
order to reduce the quadrature component resulting from. the vestigial 
sideband the spacing signal was made equal to one-third the marking 
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signal amplitude. The receiver used AVC both for amplitude control 
and space-to-mark slicing level adjustment. 

The timing of the receiver sampling instants to determine mark and 
space indications was determined by a flywheel circuit which operated 
from space-to-mark and mark-to-space transitions in the signal. This 
is similar in principle to the old Baudot quadruplex arrangements in 
telegraphy. It was a synchronous system and required a dummy signal 
for a lining-up period previous to the actual transmission of information. 
The lining-up was automatic but required some 15 to 50 milliseconds 
(25 to 80 signal elements). 

A number of experiments with the system were made on actual lines. 
Most of these showed successful transmission, though the error rates 
were not measured quantitatively. The test showed the signal margin 
against error on a cathode ray oscilloscope. The wave trace indicated 
displacements both in the signal amplitude and in the timing of the 
sampling instant. This margin has been found to correlate reasonably 
well (in an inverse relationship) with the calculated delay distortions 
of the circuits used. It also corresponds reasonably well to theoretical 
expectations.’ 

The system reported on by the Lincoln Laboratory of MIT! shows 
much general similarity to the above. Perhaps the most distinctive fea- 
ture of difference is in the use of a word start indicating or synchronizing 
signal, in the form of the high level pulse discussed above and somewhat 
similar to that used in television for scanning-line synchronization. 


2.3 Double Sideband Systems 


A prototype of these systems has been described by Horton and 
Vaughan.’ Several models have been derived from the prototype which 
differ from it, somewhat, in several respects. Large portions of the sys- 
tems are not germane to the present discussion, and only a brief de- 
scription will be given of the signals. 

An outline of the signal spectrum for the most recent of these derived 
models is illustrated in Fig. 2. The main signal is handled on a carrier at 
1,500 cycles. The bit rate is 750 per second, and the nominal effective 
band is shown as +375 cycles. A schematic roll-off is indicated. The 
words in this system are of about 100-signal element length, of which 8 
are used for synchronization. The synchronization pattern involves a 
3-signal element ready pulse on the 600-cycle carrier, simultaneous with 
the first 3 of the 6-signal element marking pulse on the 1,500-cycle 
carrier. Following this comes a spacing bit, then another marking bit 
on the 1,500 cycles only. The next bit is the first information bit. 
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Fig. 2 — Double sideband signal with auxiliary start channel. 


Marking consists of maximum carrier amplitude and spacing is zero 
carrier amplitude. The pulses are shaped both at the transmitting end 
and at the receiver before sampling. 

The earlier prototype system as described by Horton and Vaughan 
was tested over a variety of telephone facilities (not including N1 
carrier) running up to over 12,000 miles in length and found to be quite 
rugged. For reasons which are to be discussed later, the use of these 
systems over compandored circuits presents certain extra transmission 
problems regarding noise and level changes. 


2.4 Voice Frequency (VF) Carrier Telegraph 


The opposite extreme to the use of the telephone facility as a single 
band, to carry short duration pulses, is to subdivide the band into a 
large number of subchannels, each using longer duration pulses. As 
already noted, this has advantages against impulse noise, and also 
delay distortion. 

There is available for this use the VF telegraph system.’ In an AM 
form (40C1) this subdivides the space into 18 telegraph channels, which 
can each carry 100 words per minute (or 74 bits per second). A fre- 
quency shift form (48A1) is available, to give 17 channels, each to carry 
74 bits per second. 

The 18 channels use a band of 200 to 3,200 cycles in the telephone 
facility, and the lowest channel permits only a lower word speed. The 
17 channels occupy the band from 350 to 3,200 cycles. 

It has been mentioned that the telephone channels which provide 
the most serious problem for data transmission are those using compan- 
dors. The untreated N1 carrier is a principal example of such. Further, 
the type of circuits which use compandors are apt to be placed in plant 
which is relatively exposed to impulsive noise. 

The principal interest in the telegraph channels, therefore, is to exam- 
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ine how they fare in application over untreated N1 carrier. There have 
been some studies of this point. The general conclusion, to be elaborated 
below, is that although the frequency subdivision (and also the fre- 
quency shift) helps against impulsive noise, fewer telegraph channels 
may be used than over good non-compandored facilities; and at 
best, the transmission is accompanied by more distortion than expected 
in a telegraph link of the highest grade. However, a serious possibility 
for data transmission is indicated. 

2.4.1 Telegraph Tests on N1 Carrier 

Tests on this subject have been carried out by 8. I. Cory, J. M. Fraser 
and others and reported in unpublished memoranda. Before presenting 
the results of the tests, some background is necessary on the terms in 
which the results are reported. 

The performance is usually evaluated in tests of this kind in terms of 
a “maximum checkable” telegraph distortion over a short time (about 
5 minutes). The “telegraph distortion” represents the displacement 
from correct timing, of received signal transitions, after the initial 
mark-to-space transition in the “start”? element. These displacements 
are measured in percentages of the signal element duration. By ‘‘maxi- 
mum checkable”’ distortion is meant the maximum such displacement 
that is consistently reproduced in repetitions of the short testing period. 
This is somewhat larger than the root-mean-square distortion. A larger 
displacement is, of course, obtainable over a longer testing period. 
Although this measure of performance has had long use in the telegraph 
art, other measures are perhaps more readily grasped by and probably 
of more value to the data transmission engineer. Such a measure, for 
example, is the error rate. 

The error rate may be estimated through the use of the telegraph 
transmission coefficient. This is a figure which has been designed by 
telegraph engineers to indicate the performance of a telegraph circuit, 
particularly when it is made up of several sections. It is more or less 
proportional to the square of the distortion, and has the property that, 
when carefully chosen, it can be added for circuits connected in tandem. 
A small coefficient thus characterizes good transmission, and correspond- 
ingly, a large coefficient characterizes poor transmission. 

The correlation between peak distortion over 5-minute intervals and 
error rate, through the telegraph transmission coefficient, is indicated 
in Reference 8 and Table II. It is to be understood that the correlation 
is only a rough one. Particularly at the two extreme ends of the scale, 
the entries in the table can serve only as a general guide to the perform- 
ance, and the specific numbers are not to be taken too literally. 
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TABLE II — CHARACTERIZATION OF TELEGRAPH DISTORTION AND 
AND Error RATES BY TELEGRAPH TRANSMISSION COEFFICIENT 





Distortion Errors 

Transmission 

Coefficient 
RMS 5 min. peak 1 in # characters 1 in m bits 

(1) (mm) 

13.9 30 30 40 2.9 X 10? 
12.6 27 25 87 6.2 X 10? 
11.2 24 20 2.5 X 10? 1.8 X 10° 
9.8 21 15 1.5 X 103 1.1 X 10+ 
8.0 17 10 4.4 * 105 3.2 X 10° 
5.6 12 5 10° 7.4 X 10° 
4.3 9 5) 10!2 7.4 X 10! 
2.5 5 1 — — 


A very brief summary of some of the experiments in the use of VF 
telegraph over untreated N carrier is given in Table III. This portion 
of the results covers N1 circuits with compandors which have slightly 
more noise than the objective which is set for the telephone use of such 
circuits. The noise was 28 dba at the zero transmission level point, as 
against an objective of 26 dba at that point. It is noted in Section 3.3.2 
that the measurement of noise in these terms is not altogether reliable 
in the evaluation of its effects on transmission systems that use pulses. 
Thus, these experiments must be considered as giving only a general 
indication of the situation. 


TABLE III — SUMMARY OF TELEGRAPH PEFORMANCE OVER NOISY 
N-I CaRrrierR LINK 


40C] (AM) 43Al (FS) 

1. Number of channels........... 6 12 
2. Frequency space used......... 1020 2040 cycles 
3. Words per minute...:......... 75 75 
4. Total bits per second.......... 342 684 888 

_ Average Channel 
5. Peak distortion............... 16 8 18 per cent 
6. Estimated Transmission coeff. . 9 2.5 11.5 
7. Estimated errors, 1 in......... 108 1014 105 bits 


8. Peak distortion............... 25 17 22 per cent 
9. Estimated transmission coeff... 21 10 17 
10 i 


. Estimated errors, lin. ........ 1.5 X 108 4 xX 105 5 X 10° bits 
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The tabulation is first given for an average channel. The performance 
of the worst channel has, however, also been included to give an indica- 
tion of its contribution to over-all operation. 

Many of the N1 carrier telephone circuits in the plant show lower 
noise than the objective, and to this extent Table III is somewhat 
pessimistic. Also some of the tests have shown, particularly for AM, 
that the performance is somewhat improved by removing the com- 
pandors. Thus, allowing for both points, better performance can be 
expected from the average N1 circuit (less than 200 miles) in the plant. 
The transmission coefficient of 11.5 listed for Item 6 at 100 words per 
minute might go down to say 9. At 75 words per minute with FS or 
AM, it might not be over 4.5. It is clear, of course, that with further 
modifications of the N1 channels, such as to reduce noise exposures, 
better performance could be obtained. 

The broad conclusions that can be derived from these considerations 
are: | 

1. The subdivision of the frequency band into telegraph channels, 
and the use of FS, permit a workable system to be operated over a 
compandored facility like the N1 carrier without modification. This 
occurs even when the latter has noise up to and a little over the tele- 
phone objective. 

2. This workable system under such noisy conditions transmits up 
to some 350 bits per second with AM, and some 800 bits per second 
with I'S. It is accomplished with an error rate of the order: that has 
been implied for data transmission, even in the worst channel. 

3. There is a relatively wide range of performance of the system over 
different N1 circuits, and the average performance is sensibly better 
than that under the limiting conditions which have been considered. 

2.4.2 Drstribution of Signal in Allocated Bandwidth 

A more extensive discussion of the use of bandwidth is given in Sec- 
tion 3.1, below. However, a few specific points are appropriate here on 
the band use in telegraph channels. 

The spectrum of the original voice frequency telegraph system, based 


TABLE IV — Use oF FREQUENCY SPECTRUM IN TELEGRAPH CHANNEL 


AM FS 
1. Channel spacing.................. 170 170 cycles 
2. Nominal effective bands.......... 74 74 
3. Roll-off band (both sides)......... 37 26 
4 EM SWING oooh ae eee Fa OSes 0 70 
5 


. Guard band (both sides).......... 59 0 
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Fig. 3 — Utilization of telegraph channels. 


on 170 cycles between carriers, was conservatively developed for the 
60 word per minute speed of the time. The 100 word per minute speed 
has used up some of this conservatism. The use of frequency shift in the 
same channels has, however, used up the spectrum space even more. 

An outline of the band allowances is given in Table IV, and illustrated 
in Fig. 3. Item 2 of the table is based on the 100 word per minute speed, 
using double sideband. On this basis, the number is equal to the num- 
ber of bits per second. This is the minimum double sideband over which 
that number of bits can be transmitted, according to the Nyquist 
theory. Each such sideband is sometimes called a ‘nominal effective 
band.” In practice various allowances are necessary over this minimum. 

In. the first place a roll-off is necessary because filters are not infinitely 
sharp, and in addition the nature of the modulation itself forms a roll- 
off. Roll-off also leads to a signal which is more free of overshoots and 
generally ‘‘cleaner’ than when a sharp cutoff is used. Item 3 and Fig. 
3(a) and (b) show an allowance for roll-off. For the AM case this amounts 
to half the nominal effective band. For the FS case there is not quite 
that much space available. 

For the FS signal it is necessary to allow for the frequency swing 
as Item 4. For the 43A1 system this amounts to 70 cycles. In Fig. 3(b) 
the spectrum includes the region comprised by the FM swing, and up- 
per and lower sidebands. The upper and lower sidebands as formed by 
the modulation of a random signal are shown extending respectively 
above and below the extremities of the swing (instead of only above and 
below a central carrier, as they would with AM). 

A final allowance in Item 5 is a “guard band.” This is taken to mean 
a region in which the signal energy is negligible, but at the same time 
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a region in which no appreciable interference is tolerated from the ad- 
jacent band. The allowance for this is generous in the AM case. In the 
FS case the roll-off band of Item 3 tends to use up all the space not 
employed by the nominal effective band and the swing, and nothing is 
indicated as left for guard band. This is illustrated in Fig. 3(b) by the 
extremities of the roll-off band reaching the extremities of the 170-cycle 
spacing. It is to be recognized that the illustrations are diagrammatic. 
However, the extremities mentioned measure the bands occupied by 
power from random signal transitions between marking and spacing 
(as distinguished from mere mark-space reversals), analogous to the 
bands occupied by power from AM signal transitions. Comparison of 
Figs. 3(a) and 3(b) suggests how modulated signal components of signifi- 
cant intensity are displaced farther from the edges of the channel with 
I'S than with AM. 


DISTORTION IN PER CENT 


|| ef | 
pe 





SPEED IN WORDS PER MINUTE 


Fig. 4 — Experimental relation between telegraph distortion and_ speed, 
with fixed channel filters (after Jones and Pfleger’). 
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The conclusion is reached that there is hardly any excess conservatism 
in the 48A1 system. Some confirmation of this is indicated in a paper 
by Jones and Pfleger.® Fig. 4 reproduces some curves presented in that 
paper. The curve at (B) (from Fig. 5 of that paper) shows that the FS 
telegraph rises rapidly in distortion above the 100 word/min speed. The 
curve at (A) (taken from the same Fig. 5) for level-compensated AM 
shows a substantially broader and somewhat lower curve in this region. 
From curve (C) (taken from Fig. 3 of the paper) for diode modulated 
signals, it 1s seen that the sharp rise in distortion for FS is even more 
accentuated than for the relay modulator. This indicates how much 
more characteristic distortion IFS exhibits than AM because of the 
sharper roll-off of its signal bands within the confines of the same 170- 
cycle channel spacings. Of course, as the word speed is raised beyond 
the practically usable values, either type of modulation leads to so much 
power in the filter cutoff regions that the characteristic distortions be- 
come more or less indistinguishable. 


2.5 “Polytonic”’ System 


This is a frequency discrimination system experimentally proposed 
for toll and local signaling.‘ It works on 5 channels at speeds of 100 and 
300 decimal digits per second (or the equivalent of some 330 to slightly 
under 1,000 bits per second). It has some similarities to an earlier multi- 
frequency system,!® but is faster. 

The distinguishing characteristic of the polytonic system lies in the 
mathematical theory which has been followed to reduce interchannel 
interference. This analysis makes use of the theory of orthogonal func- 
tions, and is similar to that used in the computation of Fourier com- 
ponents. The mathematical analysis leads to an ideal receiver design 
for minimum. interchannel interference. The ideal detector in this 
receiver closely resembles the conventional homodyne detector. The 
detector actually used, however, represents a practical simplification of 
the latter. A complete description cannot be given here, but it may be 
noted that the theory leads to a need for synchronization of the signal 
elements in the five channels, and to the setting of an exact timing 
instant for the sampling of the received wave to obtain minimum inter- 
channel interference. The indication for this instant is obtained from 
the use of a sharp wave-front pulse in the marking channels. 

Tests with the 100 decimal digit per second system (100 decimal 
digits normally correspond to 332 binary digits) indicated that it gave 
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good operation! over toll circuits of comparatively limited length (350 
miles for four-wire voice frequency, 1,900 miles for K carrier). The higher 
speed system was designed only for local plant. 

It is clear that this system has too low a bit rate for the application 
contemplated, even in the faster form. It is also not generally adaptable 
to the variety of circuit lengths which are expected to be encountered. 


III. UTILIZATION OF THE TELEPHONE CHANNEL 


The discussion herewith covers a broad examination of some major 
characteristics of telephone communications facilities, to evaluate their 
bearing on the choice of a system for the data transmission service 
outlined before. It is of course clear that different conclusions might be 
reached for other types of service. 

The first item is an outline review of the different types of message 
telephone facilities in the plant. This is followed by an analysis of the 
different possibilities in the use of the frequency spectrum, of noise, and 
of delay distortion, in the application of the data signals. 


3.1 Telephone Facilities 


There is a rather wide variety of facilities to be found in the tele- 
phone plant, to be examined with respect to the factors that are ger- 
mane to the present question. 

The first of these factors is the frequency bandwith capability of 
the facility. For message-type voice circuits, this is generally character- 
ized as being three kilocycles (with the exception of “emergency banks,” 
which are substantially narrower, and are not to be considered as 
useable for data transmission).!” However, some of the telephone circuits 
in the plant, aside from the emergency banks, are also somewhat nar- 
rower than 3-kc, and in any case, not all the band is effectively usable 
for data transmission. As will be seen, the net available band is, in 
practice, about half of the 3ke. 

Part of the reason that not all of the frequency band is effectively 
usable is that the circuit shows delay distortion. This tends to become 
large at both the lower and the upper edges of the band. Some details 
of the delay correction are discussed further below. 

Another impairing factor in telephone facilities is the nonlinear dis- 
tortion encountered. In voice frequency facilities this comes from ampli- 
fiers and loading coils, and increases progressively with circuit length. 
In carrier facilities the nonlinear distortion arises almost exclusively at 
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carrier terminals, in the part of the circuit where the signal is at voice 
frequency. In such a case, the distortion increases with the number of 
times in the telephone facility that the signal is modulated down to 
voice frequency. Second order nonlinear distortion tends to develop 
modulation products in the lower portion of the transmission band 
which are a source of potential interference with the signal. 

Another impairment encountered in carrier telephone channels is a 
slight frequency shift; that is, a 1,000 cycle input may appear at the 
output, say at 998 cycles. This occurs because modulator and demodula- 
tor frequencies are not identical. With independent oscillators on recent 
systems this shift may amount to some two cycles. With older systems 
it can run from 5 to 10 times as much. The effects of this shift are dis- 
cussed below. The frequency shift may be avoided by working double 
or vestigial sideband and using an envelope detector, or in some carrier 
systems by locking the oscillators in a constant frequency network. This 
locking may or may not result in close phase synchronization of the 
carriers, depending on the method used to lock and the particular carrier 
system involved. | 

Still another factor is the use, or not, of ‘‘compandors.”’ A compandor 
compresses the range of speech volume in the impressed line signal and 
correspondingly expands this range at the receiver. This raises the line 
signal level during periods of low speech power, and lowers it during 
periods of high speech power without, in principle, affecting the final 
received level. The effect is to reduce the final noise in periods of low 
speech power, and increase it during periods of high speech power. A 
listener is less perceptive to the noise during high speech power levels 
than low. By this means, it has been found that the telephone circuit 
can be engineered to some 23 db more noise (and also crosstalk and simi- 
lar forms of interference) than it can without the use of a compandor. 

In the case of data signals, however, the influence of noise in causing 
error is not very much different whether the signal is marking or spac- 
ing. Thus, there is no ‘‘compandor advantage’’* (indeed there is a cer- 
tain disadvantage as pointed out earlier), and facilities that have been 
engineered to be entirely satisfactory for voice transmission are effec- 
tively some 23 db more noisy for data transmission. As a practical mat- 
ter it appears desirable to remove compandors from circuits used ex- 
clusively for data. 

A short listing 1s presented here of the various types of message facili- 

* Perhaps a simpler way to think of it is that all possible “compandor advan- 


tage’’ has already been obtained in a data system by using the best combination of 
amplitudes for mark, space, start, etc. 
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ties most frequently found in the telephone plant, and some comments 
are made on each. 

3.1.1 Voice Frequency Circurts 

There is a variety of open-wire facilities of this type. They are mostly 
short, and two-wire. Thus, repeaters can to advantage be turned one-way 
for data service. Delay correction 1s discussed later. 

Voice frequency cable facilities over more than a very short distance 
are loaded. This gives appreciable delay distortion. The loading used 
is indicated by a letter denoting the spacing, followed by a number 
denoting the loading coil inductance. Thus “‘H-44”’ means 6,000-foot 
spacing, of 44-millihenry coils, and ‘“B-88’’, 3,000-foot spacing of 88- 
millihenry coils. Conductor capacities range from 0.62 microfarads per 
mile for toll circuits, to 0.82 microfarads per mile or sometimes even 
higher for local circuits. This affects the delay distortion. 

3.1.2 Type-C Carrier Circuits'® 

This is an open-wire three-channel system operating at different 
frequencies in opposite directions, over the same pair. Historically there 
has been a variety of C systems developed, but only the C-5 system 
exists In any extensive quantity. The upper frequency cutoff in the 
voice channel is well under 3 ke. The delay distortion varies widely 
with the specific channel and direction of transmission. There is a 
variety of channel frequency allocations, and the distortion varies 
with this also. The delay distortion over some channels increases rapidly 
above 2,400 cycles. The frequency shift discussed before may be as 
much as +20 cycles. . 

3.1.3 Type-N Carrier Circutts® 

This is a short-haul twelve-channel system for use over cables. Be- 
cause of its economy it has been extensively introduced. Its principal 
characteristic, in the application of data circuits, 1s that it uses com- 
pandors. It therefore presents a noise problem. The delay distortion, 
introduced almost exclusively by the terminals is not excessive, and 
depends very little upon circuit length between the terminals. The N 
system uses double sideband transmission, and therefore exhibits no 
frequency shift between input and output signals. 

3.1.4 Broadband Carrier Systems Using A Channel Banks}® 

There is a variety of carrier systems designed for paired cable, coaxial 
cable, open wire, and radio, that use a standard grouping of twelve 
channels with associated filters, known as an ‘‘A channel bank.’”’ The 
delay distortion in these associated filters constitutes nearly all of the 
distortion measurable over the complete system. These are single side- 
band systems, and unless the local modulator and demodulator carrier 
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supplies are locked in a constant frequency network, frequency shifts 
of some -&2 cycles may be expected between input and output. 

For paired cables, these are known as K1 and K2 systems. For coaxial, 
they are L1 and L38, for open-wire, J, and for microwave radio, TD-2. 

3.1.5 Other Broadband Carrier Systems 

An O carrier system has been developed for open wire, and combina- 
tions of it are used with N for open-wire and carrier. These are com- 
pandored systems. 


3.2 Use of Bandwidth 


This section examines the more important factors which affect the 
choice of how the available bandwidth of a facility is to be used, either 
in one band or a subdivided band. 

3.2.1 Baseband Transmission 

This is the simplest type of transmission. It is used in telegraph loops 
and other short distance telegraph transmission. A mark is indicated by 
placing marking voltage across the wire line, and a space by placing 
spacing voltage. In the simplest systems the latter is zero. In “polar” 
systems it is the negative of marking voltage. 

The frequency spectrum of the signal runs down to and includes de, 
as illustrated by the solid lines in (a) of Fig. 5. 

With many transmission facilities it is difficult or impossible to trans- 
mit the de; 1.e., the circuit cuts off as is illustrated by the dotted lines. 
In such cases it is impossible to distinguish between a permanent mark 
and a permanent space. 

Extra pulses can, however, be added to the signal to insure that marks 
or spaces are not permanent, but are relieved by the opposite signal in 
some maximum interval of time. In such cases the received signals can 
be clamped on mark or space signals and the opposite condition can be 
readily distinguished. This is sometimes called ‘‘de restoration,’’ and 
strictly speaking the system ceases to use baseband transmission. It 
may be designated as ‘“‘modified baseband transmission.’’ Methods other 
than clamping have been suggested for de restoration. 

Reverse pulses can be systematically inserted after each mark or space 
pulse, according to various patterns.!! Two suggested are ‘“‘dipulse’”’ and 
“dicode”’ pulses. Such signals approach carrier signals, which are dis- 
cussed below. | 

The principal weakness of baseband transmission appears when it is 
sent over C carrier or other single sideband telephone facilities, where the 
recovered signal may vary in frequency from that sent. This causes a 
distortion of the received pulse which makes it difficult to recognize. 
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An analysis of this point is given in Appendix I. It is concluded that 
while there may be long range possibilities in baseband transmission, it 
requires more study, and it will not be considered further in this paper. 

3.2.2 AM Carrier Transmission 

The simplest of this type is double sideband transmission, as illus- 
trated by the full line of Fig. 5(b). 

A comparison of the susceptibility to noise of this arrangement, with 
that of baseband transmission, is considered in the next section. 

A further consideration required is susceptibility to nonlinearity in 
the facility. Second order modulation leads among other things to a recti- 
fication of the signal back to baseband. This is indicated by the dotted 
lines in Fig. 5(b). After such rectification of the signal by the facility, it 
is impossible to separate any overlapping portions of the signal between 
the baseband and lower sideband. Some overlap is shown. This inter- 
ference was first considered in telephotography” and is known as ‘“‘Ken- 
dall effect.” 

The possibility of Kendall effect may be eliminated insofar as second 
order modulation is concerned by moving the carrier frequency high 
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Fig. 5 — Spectra of signals with various modulations. 
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enough to prevent such an overlap. This is indicated in Fig. 5(c). It does 
not prevent third order modulation effects. 

It has been found necessary to allocate frequency bands thus to avoid 
the overlap discussed in the transmission of high grade telephotography 
over telephone type facilities. It has also been noted that allocation with 
such overlap was undesirable in some data transmission experience. 
However, the question has not been resolved in complete detail. For the 
data service under consideration, which is expected to show a very low 
error rate, it is deemed conservative to allocate bands without the over- 
lap. 

This conclusion then leads to wasting a certain part of the lower fre- 
quency range. It is still possible, however, to use this range for an auxil- 
lary signal, as in the system illustrated in Fig. 2, if the auxiliary signal 
occurs only during word starts. 

The double sideband frequency range, as indicated in Fig. 5(b) or 
5(c), is about twice the baseband range of Fig. 5(a). It is possible to re- 
duce this extension by cutting down one of the sidebands to a ‘‘vestige’’ 
of itself and sending carrier at a reduced level, as indicated by the diag- 
onal dotted line about the carrier in Fig. 5(c). This was proposed by 
Nyquist.% It is done at the expense of an increased vulnerability to 
noise, which in total amounts to some 5 or 6 db incertain typical cases.” !! 
In Section 2.1.3 discussion was given to account for 3 db of this. In the 
references cited herewith it is noted that vestigial sideband transmission 
is accompanied by an interfering component (called a ‘“‘quadrature com- 
ponent’) which accounts for the other 2 or 3 db. 

3.2.3 PM Carrier Transmission 

Certain additional immunity to noise is gained by the use of frequency 
modulation (or “frequency shift’’) of the carrier. The immunity which 
can be obtained against impulse noise can be even greater than that 
against random noise, provided that the receiver is precisely tuned. This 
was noted in Section 2.4 in connection with voice frequency telegraph. 

The noise immunity obtained from FS is in part due to the use of a 
higher average power and is at the expense of a wider frequency band 
as illustrated in Fig. 5(d). In addition to all the band that is used for a 
double sideband system, a space must be allowed for the swing. FS is 
also much less vulnerable to sudden level changes than DSB and thus 
may well be preferable to DSB for medium bit-rate service. As shown in 
Table I, these advantages can be obtained with only a small sacrifice of 
bit rate compared to DSB for equal bandwidths. | 

So far the single channel broadband FS system has not been generally 
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used over wire circuits. Hence its exact performance, particularly with 
impulse noise, is only estimated. 

On occasion not all of the band illustrated in Fig. 5(d) is allowed for. 
This leads to an increase in distortion of the signal, which has some simi- 
larity to a very close-in echo. It uses up some of the additional noise 
immunity provided by the FM, as an engineering compromise. 

Another direction along which the FM system may be practically 
extended is to use four instead of two (marking and spacing) frequencies. 
This would double the bit capacity at the expense of only a moderate 
widening of the frequency band and somewhat tighter requirements on 
noise and delay distortion (but not of level regulation, which would be 
required for a similar extension of the AM signal). 


3.2.4 Multichannel Systems 


It is possible to divide up the entire frequency band available into a 
number of separate channels and use any one of the various carrier sys- 
tems which have been described, in each individual channel. This may be 
done because the nature of the information transmitted may be better 
adapted to the narrow channel, as in conventional telegraph. It permits 
certain elements of flexibility in layout, and offers certain noise advan- 
tages (and also disadvantages) as discussed in the next section. 

In an idealized way one can proportion the various allowances for 
nominal effective band, roll-off, guard band, and swing (FS) in the same 
proportion in which they would occur in a single broad channel over the 
whole facility. Thus no frequency space would be lost by the subdivision. 
In practice, however, subdivision usually does lead to some actual loss 
in the frequency space. 

A significant limitation to frequency subdivision lies in nonlinearity of 
the facility. This leads to modulation products between the various 
channels, which interfere with other channels. 

In the case of voice frequency telegraph and other multiple channel 
systems the modulation effects are mitigated by allocating the carriers 
at odd multiples of a basic frequency. That is, any given carrier f 1s set 
at f = nk, where n is odd and k is a basic figure. Then the three second 
order modulation products are 


2f = 2nk, 
fi + fe = (m1 Os N2) ie 
fi — fe = (m — ne) k. 
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TABLE V — Uss or TELEPHONE BAND BY VARIOUS DATA SYSTEMS 








System Modulation ae f p ee Total Bits/Sec. 
1. Proposed................ VSB 1 1600 1600 
2. Exploratory. ............ VSB 1 1650 1650 
3. Exploratory. ............ Bee 1 750 750 

- ftoll.... 2.2... SB 5 100 500 
4. Polytonic eer eee DSB 5 300 1500* 
5. 40C]l Teg. .........0.0.... DSB 18T 74Tt 1332tt 
6. 48Al Teg. ............... FS 17T 74tT 12581 tT 


* Not realizable with 2 out of 5 codes used. 
+ Not realizable over some facilities. 
tT Based on 100 word per minute channel capability. 


All these products are necessarily even multiples of k and-therefore al- 
ways fall half way between carriers. This allocation, however does not 
permit mitigation of third order modulation effects. 

3.2.5 Hapertence 

Table V reviews systems which have been discussed earlier in this 
paper, to indicate the extent to which they use a general telephone 
channel facility, in terms of the bit rate output. It is clear, of course, 
that the various systems are not engineered to the same conservatism. 
These differences have already been commented upon. 

The general conclusion which one can reach from the table is that the 
use of the medium in a single channel gives possibilities of a higher bit 
rate than subdivision. However, it is to be kept in mind that the tele- 
graph facilities are conservatively engineered. Further as noted, they 
can be used to the full extent indicated only over the broader band tele- 
phone channels. For example, the full 18 telegraph channels can not be 
used over a C carrier telephone channel. 


3.3 Norse 


A general theory regarding the influence of noise on digital systems 
was presented in 1948 by Oliver, Pierce and Shannon.“ This was dis- 
cussed further at a symposium.’ Several additional points are considered 
here, relating to the effect of noise on a data transmission service of the 
type contemplated. 

3.3.1 Effect of Channel Subdivision on Vulnerability to Noise 

This has been suggested earlier at several points. A more detailed 
discussion is given of the effects in Appendix II. 

The conclusions reached there are, broadly: | 

1. Channel subdivision has comparatively small effect on vulnera- 


1478 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1957 


bility to random noise. The small effect which does occur is a disadvan- 
tage which can run up to some 3 or 4 db for ten or twelve channels, for 
the multichannel as compared to the single channel system. 

2. Channel subdivision is advantageous over single channel use, with 
regard to vulnerability to impulse noise. 

3. Channel subdivision is disadvantageous compared to single channel 
use, with regard to vulnerability to single frequency noise. 

3.3.2 Noise Effects in Vestigial Sideband System 

Some brief discussion of the noise problems in the vestigial sideband 
system under consideration has already been presented in Sections 2.1.3 
and 3.1. That such problems may be important in the use of actual 
telephone facilities has generally been checked by some unpublished tests 
carried out by J. Mallett, of Bell Telephone Laboratories. 

The noise problem is most serious in the application of data transmis- 
sion over N1 carrier. It is particularly significant for the N1 installations 
previous to the most recent. The most recent installations are engineered 
with distinctly more conservatism in regard to noise performance. As is 
noted in Section 3.1, the principal characteristic of the N1 system that 
affects this application is that it uses a compandor and that its design 
for telephone use assumes a reduction of the noise by this compandor. 
The reduction then is not realized for data signals. A second characteris- 
tic is that N1 channels are exposed to impulse noise. The channels are of 
course designed to limit such noise to the extent that it will not sensibly 
impair telephone speech. But short data pulses are more vulnerable to 
the impulse noise than speech. The 2B noise meter, which is normally 
used for telephone noise measurements, does not read sharp noise im- 
pulses according to their effect on data signals, and other methods of 
measurement have been explored. 

A summary of some of Mallett’s results is plotted in Fig. 6. Here the 
reading on the 2B meter is compared with that on a level distribution 
recorder which records peaks of 1 millisecond or longer. The “‘one per 
cent”’ point is noted, which means that one per cent of the one second 
intervals in the period of measurement contained one or more peaks (of 
1 millisecond or longer duration) of the amplitude indicated. This is 
found convenient as a measure of the error frequency tolerated in the 
system considered (one in 10° bits). 

The heavy dots indicate the correlation between the two sets of noise 
readings on idle tested channels of an operating N1 system (other chan- 
nels inthe system being busy due to normal use). All channels had com- 
pandors in. Dots that mark the boundaries of a group of tests (usually a 
single system in the group) are connected by straight lines. The open 
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Fig. 6 — Noise measurements in N1 carrier facilities. 


dots refer to an estimate (computed from the known properties of the 
compandor) of what the noise peak levels would have been, had the 
channel tested been busy with an operating data transmission signal (of 
the general type illustrated in Fig. 1). Under this condition the com- 
pandor setting would of course be different from that of an idle channel. 
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In such cases the 2B meter reading is also adjusted to the effective mes- 
sage circuit noise which would have existed in the presence of speech on 
the tested channel. 

Open triangles connected by fine dotted lines indicate summary plots 
for the idle channels, and for the busy channels. 

Examination of the plot shows that there is only a general correlation 
between the 2B set and level distribution recorder readings, as sum- 
marized by the dotted lines. Thus the 2B meter is not a reliable instru- 
ment to denote how noisy a circuit is for data transmission of the speed 
used in the proposed project. 

The telephone objective for N1 circuits, as read on the 2B meter, is 
indicated by the vertical dotted line. This shows that most of the cir- 
cults (actually 55 out of 62) met the objective. 

The suggested requirements for data are shown by the horizontal 
dotted lines. The “through” or noncompandored channel has a 3 db more 
lenient requirement (as indicated during some of the tests) than the 
compandored one. This results from the penalty mentioned earlier which 
compandors impose on in a data circuit, as compared with. the 23 db or 
‘so advantage that it introduces for voice transmission. Only a few of the 
circuits measured (actually 10 out of 62) met the suggested require- 
ments. 

The principal conclusion reached from these measurements is that 
where used for a data service of the type considered, with vestigial 
sideband transmission, most of the hitherto installed N1 circuits (and 
probably other compandored circuits) will require modification to reduce 
noise exposure. Also noise measurement will be more complicated for 
such a service than for telephony. 


3.4 Hnvelope Delay Distortion 


Some simple theoretical considerations’ have shown that the envelope 
delay distortion limits for a telephotograph circuit generally also hold for 
a data transmission circuit of the same speed. The principal difference is 
that less emphasis need be placed for data circuits upon fine structure 
deviations of the envelope delay as plotted on a frequency scale. In 
general, distortion of +0.4 signal element in the important part of the 
band has been found to give a signal-to-noise impairment of some 3 db 
in signal reception. This has been assumed here as a tentative engineer- 
ing objective. | 

In accordance with this, the envelope delay requirements for service 
with the vestigial sideband signal consideration have been set at not to 
exceed 500 microseconds (-+250 microseconds) between 1,000 and 2,500 
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cycles. This contains an element of conservatism inasmuch as the strict 
requirement is really fully implied only on the nominal effective band 
(1,200-2,000 cycles). The signal power is reduced in the roll-off and 
vestigial bands, respectively 1,000-1,200 and 2,000-2,500 cycles, and 
some corresponding liberality may be expected there. 

The delay distortion constitutes a more serious problem with a faster 
system as compared with a slower one, in part because of the wider 
frequency band occupied by it, and in part because 0.4 signal element 
represents a more severe tolerance in microseconds for a shorter element 
than for a longer one. Consequently, the limits given represent about as 
severe tolerances as may be expected to be needed with the use of a 
telephone channel. 

The distortions of various circuits have been considered to estimate the 
order of the problem involved in meeting the proposed requirements over 
links of 100 to 500 miles. 

The following conclusions are reached first for the vestigial sideband 
signal, and after this for the slower systems. 

3.4.1 Facthties Requring No Treatment 

As already noted, K2, L1, and L3 carrier, and TD-2 microwave, use 
‘‘A” channel banks to separate the individual channels, and these give 
the dominant delay distortion. This amounts to a maximum of about 
200, and a minimum of 150 microseconds, according to the exact com- 
bination of filters used. This figure is for one link of transmitter and 
receiver. A single section delay equilizer can cut the maximum residual 
to about 80 microseconds. It is concluded that these facilities present no 
important delay distortion problems. An N1 carrier link gives a maxi- 
mum delay distortion of 220 microseconds, which can be reduced to 50 
microseconds by one section of equalizer. This, then, also presents no 
serious problem. 

3.4.2 Facilities Treated by Simple Prescription 

The delay distortion of H-44 voice frequency cable in the 1,000- to 
2,400-cycle range runs to slightly under 900 microseconds for 300 miles, 
if the cable is of standard toll capacitance (0.062 mf per mile), and to 
slightly under 2,000 microseconds if of higher local plant capacitance 
(0.084 mf per mile). The use of about one section of equalization per 100 
miles reduces the residual to less than 330 microseconds for the low 
capacitance cable. For the higher capacitance cable, about three sections 
are needed per 110 miles. The J-2 carrier uses A channel banks, but has, 
in addition, directional separation filters at each repeater. This gives 
maximum and minimum distortions, respectively, for 100 miles, of 
slightly under 300 and slightly under 160 microseconds. The precise 
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distortion in any given channel depends upon its proximity to the cut- 
off of the directional filter. For 500 miles the figures are slightly over 500 
and slightly under 50 microseconds. With the same single section of 
equalization the maximum figures are reduced to about 100 microseconds 
for 100 miles, and about 300 microseconds for 500 miles. To carry out 
this equalization requires only rudimentary information on the general 
nature and correction of delay distortion. If moderate care is used in the 
prescription of equalization on a packaged basis no delay measurement 
of the circuit would in general be needed, though it is recognized that 
some difficult cases may arise. 

3.4.3 Facilities Requiring More Involved Prescription 

The delay distortion in C-5 carrier'® is influenced to a dominating ex- 
tent both by channel and directional separation filters. It varies in a 
complex fashion from channel to channel, and according to the direction 
of transmission. Its correction thus requires more involved prescription 
than is required for the other types of circuit. In some few cases measure- 
ment may be necessary. The distortion of H-88 voice-frequency cable 
runs from some 1,400 to over 3,000 microseconds per 100 miles accord- 
ing to capacitance. For 20 miles of H-174 toll cable the distortion is 
slightly under 1,400 microseconds, and its use is not contemplated. 

3.4.4 Data Systems Requiring No Corrections 

The delay distortion problem is practically non-existent for the slower 
systems. For the double sideband systems some delay correction may be 
needed if long heavily loaded circuits are used or perhaps for some other 
rare unfavorable situations, but otherwise no correction 1s necessary. 
No correction is needed for the telegraph systems. 


APPENDIX I—~ BASEBAND SIGNAL DISTORTION CAUSED BY CARRIER 
FREQUENCY SHIFT 


A simple analysis of the phenomenon may be considered. Let the 
voltage input, asin Fig. 7(a), be a raised cosine pulse between the angular 
arguments of —z and +7. That is 


V; = 1+ cos 0, (1) 


where Q/7 is the envelope frequency. When this is transmitted on the 
carrier, cos wi, the carrier signal voltage is 


Ve. = (1 + cos 2) cos at, (2) 
= cos wt + 4 cos (w — Q)t + 4 cos (w + QE. (3) 


When the carrier and one sideband are removed (say the lower side- 
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band),V. becomes (neglecting the factor 4) 
V. = cos (w + QE. (4) 


_ At the receiving end, V. is modulated with a carrier which may mo- 
mentarily differ in phase from the signal carrier by angle ¢, giving 


Vo = cos (w + Q)t cos (wt + ¢), (5) 
= $608 [(w + O)t — (wt + ¢)| + 3 cos [( + QO)t + (wt + ¢)]. (6) 


The lower frequency part of V, constitutes the recovered signal, and 
it is extracted by a filter that attenuates the higher frequency part. Thus, 
again neglecting the factor of 3, 


V, = cos (wt + Qt — wt — ¢), 


7 
V, = cos (Q¢ — ¢). ") 
(a) y (f) 
0 
-77 0 7T 
(b) (9) 
ae T 
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(c) (h) 
TF 
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s 


(e) (J) 


; 


Fig. 7 — Distortion of baseband pulse signal in single sideband carrier facility. 
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The angle ¢ progresses through 0 to 27 per cycle of difference between 
the original and recovered frequencies. That is, if this difference is 2 
cycles, then ¢ progresses from 0 to 27 twice each second. 

The effect of the progression from 0 to 7 is illustrated in Figs. 7(a) to 
7(e). When the phase is equal to z, as at 7(e), the signal is ‘‘upset”’; i.e., 
marks are changed to spaces, and vice versa. When the phase is equal 
to 7/2, the signal is effectively differentiated, as at 7(c). 

In general, the signal as specified in (1) includes harmonics of 2. An 
illustration of such a signal several dots long is shown in Fig. 7(f). As . 
before, when g = 7, the complete signal is upset, as illustrated in Fig. 
7(j). When ¢ = 7/2, as at 7(h), the signal is more or less differentiated. 
The differentiation is not exact because the successive harmonics are not 
weighted according to order, as in true differentiation. 

The distortions caused by the progression of g are what make it diffi- 
cult to recognize the signal at the receiving point. Some suggestions 
have been made for correcting the indication when the signal is upset, 
as in Figs. 7(e) and 7(j). It is more difficult, however, to take care of the 
intermediate cases, particularly 7(c) and 7(h). 


APPENDIX II ——~ EFFECT OF CHANNEL SUBDIVISION ON VULNERABILITY 
TO NOISE j 


There are three broad categories of noise to which the system may be 
exposed. These are: 

1. Random noise which is not localized in time nor frequency. 

2. Impulse noise which is highly localized in time but covers a broad 
frequency spectrum. 

3. Single-frequency noise, which is highly localized in frequency but 
which lasts a significant time (or substantial number of bits). 

We can assume two systems, A used as a single-frequency band, and 
B divided into ten channels. Correspondingly, therefore, signal pulses of 
unit duration over system A, are of 10 units duration in each channel of 
system B. 

Random noise having uniform spectral distribution, and power W in 
each channel of system B, cumulates to power 10W in system A. Signal 
power P in each channel of system B cumulates to 10P for the total 
system. If signal power 10P is used in system A, the two systems are at 
a stand-off in signal-to-noise ratio for this type of noise. 

In practice, the power capacity to handle the signals for system B must 
be made a few db higher than indicated by 10P to allow for occasional 
peaking caused during instants of unfavorable phasing among the vari- 
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ous channels. Thus, the multichannel system B is really worse off, by 
that small amount, than system A. This may amount to some 3 or 4 db 
in ten or twelve channels. 

There are occasions where crosstalk into other facilities sets the level 
permitted for the signal. It may be that concentration of the signal onto 
a single carrier aggravates this interference, as compared with that from 
a multi-carrier signal. In such cases system A may be penalized by a few 
db, as compared with system B. 

Impulse noise shows correlation among the phases of its spectral 
components. Thus a noise pulse of voltage amplitude N in each channel 
of system B, cumulates to voltage.amplitude LON in system A or 20 db 
greater. On the other hand, a signal of amplitude S in each channel of 
system B, cumulates to an RMS voltage amplitude 7/10 S, or 10 db 
greater, over the total system (plus a peaking correction which may be 
positive or negative as just mentioned). Thus, the single channel A sys- 
tem is at a disadvantage of 10 db, less the peaking adjustment, with re- 
spect to the B system. 

A further adjustment may be needed, because a single noise peak that 
affects all 10 channels of B, or 10 bits, may affect only one bit of A. 
This adjustment depends upon the word grouping of bits which is used. 
It may be neglected if all the 10 bits of B are in the same word, and if, 
in one word, an error of 10 bits is effectively no worse than an error of 
1 bit. 

Single-frequency noise lies at the opposite extreme of the gamut from 
impulse noise. The vulnerability of a signal pulse to single-frequency 
noise varies according to the relationship between the frequencies of the 
noise and of the signal carrier in the utilized signal band. | 

The pattern of sensitivity to noise over an individual channel can be 
expected to be about the same for a narrow band as for a wide-band 
channel. Thus the pattern of sensitivities in the single channel of system 
A is repeated in each channel of system B on a 10 times finer frequency 
scale. The required S/N ratio in any one channel of B remains the same 
as that for A. 

In system B each of the ten channels must put out only one tenth of 
the power of the single channel of system A (less the correction for peak- 
ing which as before may be positive or negative). Thus any one of the 
ten channels of B is 10 db (plus a peaking correction) more vulnerable 
to single frequency noise than the single channel of A. 

It must be noted that there are occasional special circumstances where 
the single frequency noise may be persistent and steady. The multi- 
channel system B may in such cases have an advantage in permitting 
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the one channel affected to be dropped, and the others to be worked 
entirely free from this interference. This of course reduces the total bit 
rate. 

To summarize the discussion in a general philosophical way, it can be 
said that there is advantage in multiplexing the signal in the manner 
that makes it as different as possible from the type of noise to which it 
is expected to be the most exposed. If the predominant noise is in short 
duration pulses, the most advantageous signal is in long duration pulses 
with frequency discrimination multiplex. If the noise is in longer dura- 
tion single frequencies, the most advantageous signal is in very short 
pulses with time discrimination multiplex. 
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Design, Performance and Application 
of the Vernier Resolver" 


By G. KRONACHER 
(Manuscript received May 29, 1957) 


The Vernier Resolver 1s a precision angle transducer which, from the 
stand-point of performance, resembles a geared up synchro resolver, except 
that the step-up ratio between the mechanical angle and the electrical signal 
is obtained electrically. 

Vernier resolvers with step up ratios of 26, 27, 32 and 33 have been de- 
signed and burlt. 

The unit 1s a reluctance type, variable coupling transformer. By placing 
all windings on the stator, sliding contacts are eliminated. Both the stator 
and the rotor are laminated. Because of the averaging effect inherent in a 
laminated construction, the accuracy of the unit exceeds by many times the 
machining accuracy. 

The performance of present experimental units is characterized by a re- 
peatability of better than +3 seconds of shaft angle, and a standard devia- 
tion error over one full revolution of less than 10 seconds of arc. 


I. INTRODUCTION 


The precise measurement of an angle is a basic operation in many 
technical fields. The observation of stars, mapping of land, machining 
in the factory are all operations which require angle measurements. Of 
course, an angle can be measured by reading a calibrated dial. However, 
in automatically controlled operations the angular position of a shaft 
_ has to be sensed electrically. The instrument which performs the con- 
version from a mechanical angle to an electrical output is called an angle 
transducer. One commonly used angle transducer is the synchro resolver. 

Basically this is a variable coupling transformer with one primary 
winding and two output windings displaced 90 degrees from each other. 
The variable electrical coupling is accomplished by placing the primary 


*The Vernier Resolver was developed under the sponsorship of the Wright 
Air Development Center. 
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winding on the rotor and the secondary windings on the stator or vice 
versa. The primary winding is excited from an alternating voltage 
source of, say, 400 cycles per second. The amplitudes of the induced 
secondary voltages of the synchro resolver are ideally proportional to 
the sine and cosine of the rotor orientation. These two induced, ampli- 
tude modulated voltages are the resolver output. 

The accuracy of commercially available synchros is, at best, three 
minutes of arc. Certainly, this accuracy is sufficient for many applica- 
tions. In the machining of precision parts and in field applications in- 
volving the measurement of elevation and azimuth of distant targets, 
_ however, accuracies down to 10 seconds of arc are required. 

One might be tempted to try to meet this requirement by merely re- 
fining the present standard synchro. However, even if this refinement 
were possible, it still would be a difficult task to transmit this near-per- 
fect synchro output and also to convert it into other analog forms with- 
out losing most of the added accuracy because of noise in the system. 
The transmission and conversion problem can be side-stepped by going 
to a so-called ‘‘two speed” or ‘‘vernier’’ representation of the angle. This 
representation is obtained by using two synchros; one, the low speed 
synchro, is positioned directly to the particular angle and the other, the 
high-speed synchro, is geared up with respect to the former. The angle 
is now represented by two synchro outputs. Assuming perfect gears the 
accuracy of this system is improved by the step-up ratio in the gearing. 

This approach has been adopted in the past, but unfortunately, it has 
major disadvantages to it. First, precision gears of better than one min- 
ute of arc are expensive, relatively large and of limited life due to wear. 
Second, considerable torque is required to overcome the gear friction and 
the inertia effect of the high speed synchro. For these reasons, it is de- 
sirable to replace the geared-up synchro by a transducer which performs 
the step-up between input and output electrically. The vernier resolver 
is such an angle transducer. 

The unit is a reluctance type, variable coupling transformer. By 
placing all windings on the stator, sliding contacts are eliminated. Both 
the stator and the rotor are built up of laminations. The step-up ratio 
is equal to the number of teeth on the rotor lamination. Prototype units 
have been built with step-up ratios of 26, 27, 32 and 33. The accuracy 
of these units is characterized by a standard deviation error of less than 
10 seconds of arc. This high degree of accuracy is due largely to the aver- 
aging effect inherent in a laminated construction. The unit may be re- 
garded, simply, as a device which senses the average orientation of 
all rotor laminations with respect to the stator. Because of the great 
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number of laminations (one hundred in the present units) the effect of 
individual imperfections in laminations is greatly reduced. 

In preparation for a close study of the vernier resolver we shall describe 
the performance of an ideal unit, and also introduce some technical 
terms. The output of the vernier resolver consists of two amplitude 
modulated voltages one of which is called the sine-voltage and the other 
the cosine voltage. The amplitudes of these voltages are proportional to 
the sine and cosine of ‘‘n’’-times the rotor orientation. The factor ‘‘n”’ 
which, of course, is a function of the rotor configuration will be called 
the order of the resolver. We shall call the arctangent of the ratio of the 
secondary voltages — sine-voltage over cosine-voltage — the ‘‘signal- 
angle’. Furthermore, to define a positive sense of rotation and to make 
the signal-angle definition unambiguous, we shall assume that, with 
continuous positive shaft rotation, the signal-angle runs through a se- 
quence of cycles, each going from zero to 360°. Thus, one signal-angle 
cycle corresponds to a shaft rotation of (1/n)th; of one revolution. This 
angular interval is called the ‘‘vernier”’ interval. 


II. DESIGN PRINCIPLES 
2.1 A Simplified Description 


Fig. 1 represents a simplified model of a third order vernier resolver. 
The unit consists of a laminated rotor with three equally spaced teeth 
and a laminated 4-pole-shoe stator. Each pole-shoe bears one exciting 
coil (not shown in the figure) and one output coil. Successive exciting 
coils are wound in opposite directions, connected in series, and energized 
from an ac source. Thus, successive pole-shoe fields alternate in phase. 
The two output windings each consist of two diametrical output coils 
connected in phase opposition. 

If the rotor were a circular cylinder, the net voltage in either output 
winding would be zero. However, because of the three rotor teeth, the 
induced voltage of either output winding goes through three identical 
cycles per rotor revolution. Consequently, the amplitude EF, of the in- 
duced cosine-voltage e, can be represented as a Fourier series of three 

times the shaft angle @, , 


Ei, = Ey, cos (86m) + Es cos [8(88m)| + ---, (1) 


where 4; , #3 are the Fourier components of /, with respect to (86m). 

The series is free of even harmonic terms because of the symmetry be- 
tween positive and negative half-cycles. The expression for the ampli- 
tude H, of the sine-voltage e, 1s obtained by substituting [6, — (a/6)] 
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for Om in (1); 


The magnitudes of the Iourier components depend on the design 
details of the unit. In a properly designed unit all higher order Fourier 
components are sufficiently small so that the induced signal voltages 
closely approximate those of an ideal third order resolver as expressed 
by the following equations: 


E, = KE, cos (86m), | (3) 

i, = Ey, sin (84m), (4) 
_1 Hs 

és = tan “7 30m (5) 


where 6s is the signal-angle. 


2.2 Analysis of Practical Case 


Tig. 2 showsanassembled unit, and typical stator and rotor laminations 
are illustrated in Figs. 3 and 4. | 


Owé — —-— —---—-----~— @-—------ —-——- 0 





Fig. 1 — Schematic of a 3rd order vernier resolver. 
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Fig. 2 — View of the assembled vernier resolver and its rotor. 


The stator lamination is of ten pole-shoes, each pole-face having 
three teeth. All 30 teeth of the stator are equally spaced. The number 
of equally spaced teeth of the rotor lamination is equal to the order “‘n”’ 
of the resultant vernier resolver. 

The exciting winding, as in the simplified model, produces ac mag- 
netic fields alternating in phase from one pole-shoe to the next. Each of 
the two output windings is distributed over all ten pole-shoes. 

To describe the turns distribution of these windings it is necessary 
to define the positive winding sense and the electrical angle of a pole- 





Fig. 3 — Stator lamination of the vernier resolver. 
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shoe. The positive winding sense for a given pole-shoe is that of the 
exciting coil. The electrical angle of a pole-shoe is its mechanical angle 
measured clockwise, with respect to a reference on the stator, multiplied 
by the number of rotor teeth (the order of this resolver). 

The winding which produces the cosine voltage, e,, consists of ten 
coils, one on each pole-shoe, connected in series. Each coil has a dif- 
ferent number of turns depending on the pole-shoe to which it belongs. 
Specifically, this number of turns is equal to a design constant ‘Vl’ 
multiplied by the cosine of the electrical angle of the pole-shoe. Similarly, 
the winding which produces the sine-voltage, e,, consists of ten coils, 
one on each pole-shoe. The number of turns of each coil is equal to the 
same constant ‘‘t”? multiplied by the sine of the electrical angle of the 
particular pole-shoe. 

With a, being the electrical angle between adjoining pole-shoes and 
with the electrical angle of pole-shoe No. 0 equalling zero, the turns of 
the coils of the cosine-winding on pole-shoes No. 0 through 9 are: 


too = # cos (0) 


tc. = t cos (a) 
(6) 
tco = tcos (Qae) . 
Similarly the turns distribution, ¢; , of the sine winding is: 

iso = t sin (0) 
ts = tsin (a) 

| (7) 
t <9 = é sin (Dae) 


ORDER OF NUMBER OF 
RESOLVER ROTOR TEETH 


26 26 
27 27 
32 32 
33 33 





Fig. 4 — Rotor lamination of the vernier resolver. 
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To obtain the voltage induced in the output coils, the flux amplitude 
for each pole-shoe must be established. Defining the electrical angle of 
the rotor, #,-, as its mechanical angle multiplied by its number of teeth 
and choosing @, to be zero when the center of a rotor tooth lines up with 
the center of pole-shoe No. 0, one can write for the flux amplitudes 
go through ¢y of pole-shoes No. 0 through No. 9: 


do = Ay + A; cos 6, + Ao cos 20, + -:- 


di = Ao + Ai COS (66 — ae) + °°: 
a Se. oe (8) 


dy = Ao + Arcos (8 — Yae) + °° 


where Ay, A1, Az,.... are the Fourier Components of ¢. 

The amplitude, EF, , of the voltage induced in the cosine winding is 
the sum of the products of the pole-shoe flux, ¢, , measured in [volt sec] 
and the coil turns, é,,, multiphed by the exciting current frequency in 
radians per second, w: | 


9 
ft, —= W = Pyley . (9) 


Substituting the values of ¢,, and ¢, from (6) and (7) and neglecting all 
higher order Fourier components of ¢, one obtains 


E, = wA, COs 6, . (10) 
Similarly one obtains for the amplitude F, of the sine voltage: 


E, = ss wA, sin 0. (11) 

As required, the two induced secondary voltages are proportional to 
the cosine and sine of the electrical rotor angle. 

As shown in the appendix, an analysis which takes into account the 
higher order [Fourier components of the pole-shoe flux shows that a 
sinusoidally distributed winding is sensitive solely to the so-called slot- 
harmonies. The order “‘m’”’ of these harmonics is given by the expression 


m=kqtl — (12) 


where “‘k”’ is any integral positive number and q is the number of pole- 
shoes divided by the largest integral factor common to the number of 
pole-shoes and the number of rotor teeth. For instance, 1n the case of a 
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ERROR IN SECONDS 





SHAFT ANGLE, Om 


Fig. 5 — Error of a 27th order vernier resolver over ; vernier interval. 


27th order vernier resolver this common factor is 1 and consequently the 
slot harmonics are of order: 9, 11, 19, 21, etc. 

The effect of the slot harmonics can be reduced by the following 
“means: 

a) Selecting the dimensions as well as the number of rotor and stator 
teeth such as to keep the higher order flux components low. 

b) Using a “skewed” rotor or stator, in which successive laminations 
are progressively displaced with respect to their angular orientation. 


III. PERFORMANCE 


Clifton Precision Products Co. built experimental resolver models of 
order 26, 27, 32 and 33 using the laminations shown in Figs. 3 and 4. 
The best results were obtained with 27th order resolvers. Their per- 
formance is described in the following sections. 


3.1 Repeatability and Accuracy 


The repeatability is better than -&3 seconds of shaft angle. 

Figs. 5, 6 and 7 show the error curves taken on a 27th order vernier 
resolver after compensating with trimming resistors for the fundamental 
and second harmonic error with respect to the vernier interval. (In es- 
sence, the effect of these trimming resistors is either to add or to sub- 
tract a small voltage to one or both of the resolver signals.) Fig. 8 shows 
an error curve before trimming.* 


3.2 Temperature Sensitivity | 
The error introduced by a temperature change of 70°C is less than 25 
seconds of shaft rotation. 


* The error curves really represent the combined error of the tested resolver 
itself plus that of the testing apparatus. 
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Fig. 6 — Error of a 27th order vernier resolver over one vernier interval. 


It may be pointed out that the housing of the tested unit was of 
aluminum. A unit with a non-magnetic steel housing should be of lower 
temperature drift, because stator stack and housing would then have the 
same temperature coefficient of expansion. 


3.3 Transformation Ratio, Input and Output Impedances 


At maximum coupling the induced output voltage 1s 0.123 times the 
exciting voltage and is leading in phase by 6°. 

The impedance of the input winding with the output windings open 
is 117 + 7 781 ohms. 

The impedance of the output windings with the primary winding 
shorted is 235 + 7 920 ohms. 

The effect of the rotor position on this impedance is hardly noticeable. 


3.4 Output Signal Distortions 
The harmonic content of the output signal at maximum coupling 1s: 


fundamental 1.7 volts 
2nd harmonic 0.2 mv 
3rd harmonic 13.5 mv 
5th harmonic 5.4 mv 


The harmonic content of the output signal at minimum coupling (null 
voltage) is: | 


fundamental 1.6 mv 
2nd harmonic 0.05 mv 
3rd harmonic 2.0 mv 
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Fig. 7 — Error of a 27th order vernier resolver over one shaft revolution. 


ERROR IN SECONDS 
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Fig. 8 — Error of a 27th order vernier resolver before trimming. 


3.5 Moment of Inertia and Friction Torque 


The moment of inertia of the rotor of a 27th order harmonic resolver 
is 63 gram cm sq. 

The maximum break-away friction torque among five units was 0.027 
in. oz. No change in this torque due to excitation of the unit could be 
detected. 
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IV. APPLICATION 


In its application, the vernier resolver is usually directly coupled to a 
standard resolver or some other coarse angle transducer. Such a system 
which represents a variable, in this case the shaft angle, in two scales, 
coarse and fine, will be called a vernier system. 

The following sections describe applications using the vernier resolver 
in an encoder, a follow-up system and an angle-reading system. 


4.1 Vernier Angle Hncoder 


A vernier angle encoder converts a shaft angle into a pair of digital 
numbers, one being the coarse and the other being the vernier number. 
This type of encoder can be built by mechanically coupling a standard 
resolver directly to a vernier resolver. The outputs of the two resolvers, 
after encoding, represent the coarse and the vernier number. 

The output of a resolver may be encoded, for instance, by the following 
method. The primary winding of the resolver is excited from an a-e 
source of, say, 400 cycles per second. The two induced secondary volt- 
ages are in phase with each other. Their amplitudes are proportional to 
the cosine and sine of the electrical rotor angle, 6, . 

These two amplitude modulated voltages are combined by means of 
two phase-shifting networks into two phase-modulated voltages. One net- 
work first advances the sine voltage by 90° and then adds it to the cosine 
voltage. The other network performs the same addition after retarding 
the sine voltage by 90°. The result is two constant amplitude voltages 
with relative phase shift of twice the electrical rotor angle. The time in- 
terval between the respective zero crossings of these two voltages 1s con- 





Fig. 9 — Resolver servo system. 
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verted into digital representation by means of an electronic stop-watch 
(time encoder). 


4.2 Vernier Follow-up System 


A vernier follow-up system can be built much like the present two 
speed synchro control-transformer system, except that the geared up 
synchros are replaced by vernier resolvers. Fig. 9 illustrates the vernier 
portion of this system. Since the output impedance of the vernier re- 
solver is fairly large, it may be desirable to use amplifiers, as shown in 
Fig. 9, to energize that vernier resolver which plays the part of the con- 
trol-transformer. 


4.3 Vernier Angle-Reading System 


A visual vernier angle-reading system as required to read the position 
of a rotary table can be built by using the output of a vernier resolver 
to position a standard resolver. 

The coarse angle can be read as usual from calibration lines marked 
directly on the rotary table. The vernier angle reading is obtained by 
coupling a vernier resolver directly to the rotary table. The output of 
this resolver is used to position a standard control transformer. This 
control transformer will go through ‘‘n” revolutions for each revolution 
of the rotary table, where ‘‘n’’ is the order of the vernier resolver. The 
reading of a dial coupled either directly or through gears to the control 
transformer provides the vernier reading. 


V. SUMMARY 


Vernier resolvers of order 26, 27, 82 and 33 have been designed, built 
and tested. 

-The construction of the unit is very simple because all windings are 
located on the stator. The absence of brushes and slip rings makes the 
unit inexpensive in production and reliable in performance. 

The performance of present experimental models is characterized by 
a repeatability of better than +3 seconds of arc and by a standard de- 
viation error over one revolution of less than 10 seconds of arc. 

Production units should be of even higher accuracy because better 
tooling fixtures would be used and minor design improvements would be 
incorporated. 

The principal forseeable application of the resolver les in vernier sys- 
tems. Vernier encoder, vernier servo and vernier angle-reading systems 
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are readily obtained by applying existing techniques to the vernier re- 
solver. 
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APPENDIX 
Symbols 
iE Amplitude of induced voltage 


py Number of pole-shoes 

nm Number of rotor teeth 

6. Electrical rotor angle, equal to its geometrical angular position mul- 
tiplied by n 

q op divided by largest integral factor common to n and p 

n’ mn divided by the same factor 

vy Pole-shoe number running from 0 to (p — 1) 

m Order of Fourier component representing the pole-shoe flux as a 


function of the electrical rotor angle, 6, 
a. Electrical angle between adjoining pole-shoes, equal to the geometri- 
cal angle multiplied by n 
kj A number equal to zero or to any positive integer. 
In accordance with equations (7) and (8) the voltage induced in the 
sine-winding coil on the vth pole-shoe by the mth flux harmonic is: 


Esmy = wlAm cos m (Oe — vae) t sin (va)]. (18) 
After trigonometric transformation: 
Esmyv = 4Amtw [sin (md, — (m —1)vae) 
+ sin (—mé + (m + 1)va,)] . 


The voltage, Hm , induced in the sine winding is obtained by summing 
the expression of (14) over all values of v. Since (pa) is a multiple of 27 
the summing of all sine terms from »v = 0 to »v = (p — 1) results in zero 
unless the angle (m + 1) a, is an integral multiple, k, of 27. This condi- 
tion is spelled out in the following equation: 


(14) 
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(m +1) a = k2r. (15) 


The electrical angle a, being the mechanical angle between successive 
pole-shoes divided by the number of rotor teeth is: 


pe (16) 
p 


Dividing p and n by the largest common integral factor, one can write 


Ae = ome. (17) 
q 


Substituting this expression into (15) and solving for m, one obtains 
m= “4 + 1 | (18) 


where m and & are integers or zero. Quai, (18) is satisfied for 
the following values of m: 


m=1; q+ 1; 29 + 13", (19) 


The amplitude of the voltage, /.m , induced in the sine winding by flux 
harmonics of order m, where m is specified by (19), is 


i 5 Amto sin (mé,). (20) 


Similarly one obtains for the voltages, H.,, induced in the cosine 
winding 


Big. = 5 Anes cos (mé.). (21) 
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